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INTRODUCTION 


NUMBER of stocks of Drosophila melanogaster, some of which arose by 

changes from wild type to Bar eye, and others by reversions or partial 
reversions from Bar, double-Bar or double infra-Bar toward wild type, have 
been accumulated in this laboratory. Most of the stocks originated in this 
laboratory and were obtained by irradiating males with X-rays, mating them 
with normal! females, and selecting and breeding mutant females from the F;. 
A few stocks were obtained by workers in other laboratories. The origin of 
each stock will be given in the descriptions which follow. 

A cytological analysis of these stocks was made with the following aims: 
(1) to determine, if possible, the exact relationship of the Bar locus to the bands 
in the 16A region of the salivary gland chromosome, and (2) to study the re- 
lationship of this locus or band to the loci adjacent to it either in the normal 
chromosore or by virtue of arrangement. 

All the stocks examined were obtained as changes from wild type, Bar, 
double-Bar, or double infra-Bar, and in order to determine correctly the cyto- 
logical changes associated with changes in phenotype, it was necessary to 
make a preliminary study of the Bar, double-Bar, and double infra-Bar stocks 
in this laboratory. 

The original Bar (B) mutant, found by Tice (1914) and established in stock, 
was found by independent examination of the salivary gland chromosomes by 
BRIDGES (1936) and by MULLER, PROKOFYEVA, and Kossikov (1936) to have 
a tandem duplication of the seven bands of the 16A section in the X chromo- 
some, while BripcEs described double-Bar (BB) as carrying these bands in 
triplicate. Infra-Bar (B*), obtained by StuRTEVANT (1925) from Bar, was 
found by BrincEs to have a duplication similar to, if not identical wh, that 
of Bar, while double infra-Bar (B*B*) was associated with a triplication. Sub- 
sequently Rapoport (1940) has built up, through unequal crossing over, a 
Bar series with multiplication of the 16A region up to eight times. 

The Cold Spring Harbor B and BB stocks were found to correspond with 
BRIDGEs’s original analyses. The chromosomes of the B‘B‘ stock were in- 
distinguishable from those of BB. 


TECHNIQUE 


Those changes in the B263 series described below which have serial num- 
bers below 49 were obtained after irradiation of males with X-rays at a dos- 
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age of 2500-3000r. The B263-49 and B263-51 stocks were obtained after 
treatment with rooor. The B reverse series (B rev 1-3) and B partial series of 
BIsHOP were obtained after treatment with either 1ooor, 2000r, or 400or. 

Generally the chromosomes of the mutant stocks were studied in heterozy- 
gous females carrying one set of wild type chromosomes, but in a few cases 
(for example, B™' and B”?) males and homozygous females were also studied; 
and in the case of B263-24, a ring chromosome (X°) was found useful in ob- 
taining unsynapsed figures for analysis. 

Slides of salivary gland chromosomes were stained with aceto-carmine or 
acetic orcein. 

For cytological analysis, a 90 X, 1.3 N.A. apochromatic objective was used, 
with an oil immersed 1.4 N.A. condenser, 12.5 X compensating oculars, and a 
Bausch and Lomb research lamp with the green Wratton filter number 61. 

In the following cytological descriptions, the bands of the salivary gland X 
chromosome will be designated according to BRIDGES’s 1938 map and the 
bands in autosomes according to his 1935 maps. Where a chromosome has two 
or three tandem repeats of the 16A region, homologous bands in the first, 
second, or third repeat regions of the chromosomes will be distinguished by 
superscript numbers, thus: 16A1-2, 16Ar-2, 16Ar1-2®), 

Some of the analyses were made previously by Dr. B. P. KAUFMANN, and 
confirmed by the author. 


DESCRIPTION OF CHANGES AFFECTING THE BAR LOCUS 


Changes from wild type to Bar (see fig. 1) 


B263-47 (DEMEREC 1938, unpub.). From X-rayed y o’. Eyes Bar-like, but 
larger than Bar. Not lethal. Inversion with first break between 16A1-2 and 4 
(position relative to 16A3 uncertain), second break in chromocenter following 
20AI-2. 
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Ficure 1.—Diagram showing 16A region of the normal X chromosome and 
cytological changes associated with changes from wild type to Bar eye. 
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B263-48 (BIsHOP 1939, unpub.). From X-rayed Sw-b co. Eyes Bar-like, 
but larger than Bar. Not lethal. Transposition with breaks between 3E1-2 and 
3:4, between 15F9 and 16A1-2, and in chromocenter following 20A1.2. The 
section 16A1.2—20A1.2 is inserted between 3E1.2 and 3.4. Genetic tests to 
obtain crossover types from B263-48/sc cv v f females, made by BisHop (un- 
published) and confirmed by the author, show that the left end of the re- 
arranged X, with the duplication for 16A1.2—20A1.2, produces the Bar effect, 
while the corresponding deficiency, consisting of the left end of the sc cv v f 
chromosome and the right end of the rearranged X, is not recovered, and is 
therefore lethal in heterozygous condition. 

BM! (MULLER 1935). From X-rayed + co’. Weak allele of Bar. Not lethal. 
Inversion with first break between 16A1.2 and 5.6 (position relative to 16A3 
and 4 uncertain) and second break in chromocenter following 20A3. 

BM? (MULLER 1935). From X-rayed + co’. Weak allele of Bar, eyes rather 
variable, slightly smaller than in B'. Not lethal]. Inversion with first break 
between 16A1.2 and 5.6 (position relative to 16A3 and 4 uncertain) and second 
break in chromocenter to right of nucleolus forming region, about 20E. 


Changes from Bar to wild type (see fig. 2) 
A. Complete reversal 


Df(1){B-257—6 (BRIDGES 1917). Spontaneous from w B o. Eyes wild type. 
Lethal, homozygous and hemizygous. Deficiency for 15F1.2-16A7™ inclusive. 

B263-5 (DEMEREC 1933, unpub.). From X-rayed B o. Eyes wild type. 
Lethal, homozygous and hemizygous. Inversion with first break separating the 
two repeat regions of 16A, and second break between 17A3.4 and 5.6. 
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FicuRE 2.—Diagram showing the 16A repeat in the Bar chromosome and cytological 
changes associated with complete or partial reversals from Bar to wild type. 
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B263-7 (DEMEREC 1933, unpub.). From X-rayed su B oc. Eyes wild type. 
Lethal, homozygous and hemizygous. Stock lost. Only two slides available, 
no change detected in salivary gland chromosomes, but the possibility that 
there is a small deficiency is not excluded. 

B263-20 (DEMEREC 1934, unpub.). From X-rayed sn B o&. Eyes wild type. 
(B263-20/B equivalent to +/B). Lethal, homozygous and hemizygous. De- 
ficiency for both 16A sections, 16A1.2—16A5.6® inclusive (16A7® un- 
certain). 

B263-46 (BATE 1937, unpub.). From X-rayed B o. Eyes wild type. Lethal, 
homozygous and hemjzygous. Deficiency for one 16A section, 16A1—7 in- 
clusive. 

B rev.1 (BisHop, unpub.). From X-rayed B. o’, 1ooor. Eyes wild type. Not 
letha]. Salivary gland chromosomes show no detectable change. 

B rev.2 (BisHop, unpub.). From X-rayed B o, 4oo0or. Eyes wild type. Not 
lethal. Inversion with first break between 3F7.8 and 4A1.2, and second break 
between 16A1.2® and 16A4® (position of 16A3® uncertain). 

B rev.3 (BisHop, unpub.). From X-rayed B <’, 20oor. Eyes wild type. Not 
lethal. Inversion with first break probably between 16A5.6™ and 16A1.2 
and second break in chromocenter following 20A4.5. 


B. Partial reversal 


A cytological examination was made of six stocks with eyes intermediate 
between Bar and wild type, all obtained by Bishop from X-rayed Bar males. 
In all six cases (Bar partial 1, 2, 3, 4, 6 and 7) the chromosomes were appar- 
ently normal. B partial 1 was obtained with a dosage of 2ooor, B partial 3 with 
rooor and the others with 4ooor. 


Changes from double-Bar and double infra-Bar towards wild type 
(see fig. 3) 
A. Complete reversal 


B263-24 (DEMEREC 1934, unpub.). From X-rayed sn B‘B‘ o&. Eyes wild 
type. Lethal homozygous and hemizygous. Complex rearrangement in X with 
three breaks: first between 1oC1.2 and D1.2, second between 12D1.2 and 
E1.2, third between 16A5.6™ and 16A1.2®. New order: (o-10C1.2)—(16A®-— 
12E1.2)—(12D1.2—10D1.2)—(16A®-—20). 

B263-34 (DEMEREC 1934, unpub.). From X-rayed yB‘B‘ o. Eyes wild type. 
Lethal homozygous and hemizygous. Salivary gland chromosomes show no 
detectable change. 

B263-38 (DEMEREC 1934, unpub.). From X-rayed yB‘B‘ o. Eyes wild type. 
Lethal homozygous and hemizygous. Salivary gland chromosomes show no 
detectable change. 

B263-43 (HoovER 1935, unpub.). From X-rayed yB*B‘ o. Eyes wild type. 
Lethal homozygous and hemizygous. Deficiency for two of the triplicated 
16A regions. 
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FIGURE 3.—Diagram showing 16A triplication in the double-Bar and double infra-Bar chromo- 
somes, and cytological changes associated with complete or partial reversals from double-Bar and 
double infra-Bar to wild type. 


B263-51 (SUTTON 1940, unpub.). From X-rayed BB co’, 1ooor. Eyes wild 
type. Not lethal. Salivary gland chromosomes show no detectable change. 

{B257-9 (DEMEREC 1934, unpub.). From X-rayed yB‘B‘ o. Eyes wild type 
({B257-9/B equivalent to +/B), bristles forked. Lethal homozygous and 
hemizygous. Deficiency for 15F1.2-16D1.2 inclusive. 


B. Partial reversal 


B263-28 (DEMEREC 1934, unpub.). From X-rayed B‘B‘ o. Eyes large like 
B‘. Not lethal. Deficiency for 16A4—16A5.6® inclusive (presence of 16A3 
and 16A7® uncertain). 

B263-49 (SUTTON 1940, unpub.). From X-rayed BB a’, 1ooor. Eyes large, 
variable in size. Not lethal. Salivary gland chromosomes show no detectable 
change. 

B* (STEINBERG 1940, unpub.). Spontaneous from BB ao. Not lethal. Eyes 
like B‘, very variable. Salivary gland chromosomes show no detectable change. 


DISCUSSION 
The Bar locus 


It will be seen from the charts in figures 1-3 that in Bar changes of all classes 
where rearrangements are involved, the breakage points are confined to the 
left half of the 16A section following 15F9 and preceding 16A5.6. For the 
purpose of locating the Bar gene, the changes which involve heterochromatin 
may be disregarded for two reasons: they are more difficult to analyze pre- 
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cisely, and, besides, in such rearrangements heterochromatin is known to 
influence loci not immediately adjacent to the breaks (DEMEREC 1941). There 
remain six known and carefully analyzed changes shown in the charts— 
namely, B263—5, B263-24, B263-48, B rev.2, B®‘ (DOBZHANSKY 1932) and the 
original Bar duplication. In four of these changes, the chromosome has been 
broken immediately to the left of 16A1.2, in one (B*) the break separates the 
two halves of the 16A1.2 doublet, while in the sixth (B rev.2) the break may 
be immediately to the right of 16A1.2, although its position relative to 16A3 
is uncertain. 

This distribution of breaks suggests that the affected locus is associated with 
the 16A1.2 band. The strongest evidence for this has been derived by GRIFFEN 
(1941) from experiments with the T(1;4) B* stock. In this translocation the 
two 16A sections of the Bar duplication are separated, the left section being 
attached to the proximal part of 4, while the distal end of 4 is attached to the 
right 16A section, immediately preceding 16A1.2®. GRIFFEN found that the 
Bar effect in this stock is confined to the 4-XR chromosome, where material 
of 4 is attached next to 16A1.2°. The XL-4 part has no Bar effect, but such 
an effect was induced through a deletion of the whole region between 3C2.3 
and 16A1.2™, which brought 16A1.2™ into a new association with the 3C2.3 
band. 


The Bar position effect 


The Bar locus has no detectable phenotypic expression in the normal 
chromosome, for in heterozygous condition, a chromosome deficient for this 
locus (as in B263-—20) is equivalent in its phenotypic effect to a normal chromo- 
some, and as STURTEVANT (1925) has already pointed out, the hemizygous 
male with one normal X chromosome is also wild type, not Bar. 

It has been made clear (MULLER 1936) that the Bar effect is not caused 
simply by increased dosage of the Bar gene, but is due to a position effect. 
DoBZHANSKY (1932) and DuBININ and VoLorTov (1936) believe that the Bar 
locus produces the Bar effect when it is adjacent to certain other genes which 
are relatively specific in their interaction with this locus. It is conceivable that 
the Bar locus is incapable per se of producing the Bar effect, either with or 
without intragenic mutation but requires a specific interaction between this 
and some other locus. Although one cannot exclude the possibility that a Bar 
effect might be produced by intragenic mutation at the Bar locus, the rest of 
this hypothesis seems probable for the following reasons: 

1. No Bar type without a visible rearrangement has ever been obtained 
from wild type flies. If the Bar gene were effective per se, but inhibited by an 
adjacent locus in the normal chromosome, one would expect that a “point 
mutation” might change this inhibitor in such a way as to allow the Bar gene 
effective expression without rearrangement. 

2. The relative frequency of breaks in the Bar region and changes from wild 
type to Bar might be expected to give some evidence for or against the hy- 
pothesis. Thus, if the Bar gene requires a specific activator, only a few of the 
possible rearrangements with breaks adjacent to the Bar locus will result in a 
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Bar phenotype; while if it is merely inhibited by a neighboring locus, almost 
any break separating it from this locus might result in a Bar effect. (It would 
become impossible to distinguish between the two cases, of course, if such 
“activators” or “inhibitors,” as the case may be, were very numerous and 
distributed throughout the chromosomes.) 

DvuBInIn and Vo.Lorov (1936) state that “breaks in the 16A section of the 
X chromosome occur much oftener than the cases of Bar de novo.” The exact 
positions of breaks have not been determined by these workers, and the residue 
of breaks in the 16A section may be ineffective because they are not imme- 
diately adjacent to the Bar locus. The question could be answered statistically 
by a detailed cytological analysis of a large sample. 

A very rough estimate of the theoretical expectation of Bar changes from 
breaks adjacent to the Bar locus may also be made. According to FANo (1941), 
if it be assumed that breaks are evenly distributed in the euchromatic regions, 
there is about one chance in 4000 of breakage immediately adjacent to any 
given locus, in sperm irradiated at 300or. If only breaks which recombine with 
euchromatin are considered, the chances are about 1/5000. FANo further cal- 
culated that the number of Notch changes obtained from an estimated total 
of 740,000 gametes investigated was only about 10 percent of that expected 
if all breaks immediately adjacent to the Notch locus were effective in pro- 
ducing a change in that locus. 

In the case of Bar, we can make no close estimate of the total] number of 
gametes effectively investigated in this and other laboratories. It should, how- 
ever, be of about the same order as the total for the Notch locus given above. 
Only three cases of Bar de novo involving rearrangements with euchromatin 
have been recorded (the original Bar, B**, and B263-48), as compared with 
11 Notch changes. This would indicate that the “sensitivity” of the Bar locus 
to the occurrence of adjacent breaks is perhaps lower than that of the Notch 
locus by a factor of 10, and differs from it in degree rather than in kind. 

3. Further, if the Bar effect depends on the contiguity of specific activating 
genes, the reversal from Bar to wild type could occur either through any 
rearrangement separating the two interacting loci, or by a mutation at the 
Bar locus, or by a mutation at the specific locus adjacent to Bar. Accordingly 
reverse mutation of Bar should be far more frequent than the origin of Bar 
de novo. . 

DvuBININ and VoLotov obtained one change per 382 chromosomes from 
irradiated Bar flies (dosage not stated), and 23 such changes have been ob- 
tained in this laboratory, while the frequency of occurrence of Bar de novo 
(of which there are only seven cases recorded) must be very much lower. 

If, then, the Bar phenotype is considered as due to interaction between two 
loci (or between their products) cases of reversal or partial reversal to wild 
type, without a cytological change (such as B263-7, B rev.1, B263-38, B263- 
49 and B*°) may be considered as due to intragenic mutation either of the Bar 
locus or of the adjacent interacting locus. In one such case (B rev. No. 44) 
VoLotTov (1937) was able to show that the Bar locus remained unchanged, 
and it was the specific locus adjacent to it, presumably, which was affected. 











104 EILEEN SUTTON 


In several rearrangements with new breaks in the Bar region and reattach- 
ments of new loci in that region, the Bar effect was restored. 

In the case of B rev.2, however, the break is to the right of the 16A1.2 
band (that is, nearer to the Bar locus than to the other locus involved), and 
the association of bands 16A7-—16A1.2@ is undisturbed. This may be inter- 
preted as an example of breakage and simultaneous mutation of the Bar locus, 
rather than mutation of its neighbor. This possibility has not been tested, 
however, by VoLotov’s method of trying to restore the Bar phenotype by 
inducing new rearrangements. 

A phenomenon which Durnin and Votorov (1936) found also is the com- 
plete reversal to wild type from double infra-Bar and double-Bar (B263-—34 
and B263-51). The first of these reversals was also associated with a lethal 
effect. This could be interpreted as a simultaneous change in two loci separated 
by several bands. In another complete reversal] from B‘B‘ (B263-24), according 
to this interpretation, a rearrangement which destroyed the Bar-producing 
association 16A7—16A1.2@ was combined with a change of effect in the 
association 16A7—16A1.2 and with a lethal effect. It has been shown in the 
case of the Notch locus (DEMEREC and FANO 1941) that deficiencies equivalent 
to 1-15 bands in the salivary gland chromosomes may be due to a “single event” 
process—for instance, the excitation produced by a single electron hit. One 
case has been described in which two breaks and two gene changes, spread over 
a region equivalent to four bands of the salivary gland chromosome, were 
probably produced in the course of such a “single event” process. It is not 
excluded, therefore, that the double reversals referred to above may have 
originated in a like way by simultaneous mutation of two loci. On the other 
hand, the phenomenon might well be due to a single mutation at the Bar locus 
or closely linked with it, having the effect of a suppressor of double-Bar or 
infra-Bar. 

Table 1 shows a comparison of the data of this paper with those of VoLoTov 
(1937). The types obtained correspond fairly closely, with one exception—no 


TABLE I 


A comparison of the data of VoLoTov (1937) in columns headed (a) with the data presented in figures 
1-3, in columns headed (b). 





ABERRATIONS: ABERRATIONS: MUTATION: 
TYPE OF TOTAL BREAK NO BREAK DEFICIENCY NO ABER- 
CHANGE IN 16A IN 16A RATION 


(a) (b) (a) (b) (a) (b) (a) (b) (a) (bd) 





+ to Bar 8 6 8 6 — _ _ - _- _ 
B to Bt or B- 17 6 4 — 6 — _ — 7 6 
Bto+ 26 8 8 3 I _ I 3 16 2 
BB toB 15 3 7 _ _ _— _ I 8 2 
BB to + 9 6 5 I _ _ ~— 2 4 3 


Total 75 29 32 10 7 — I 6 35 13 
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equivalent was found for the reversals obtained by VoLotov with aberrations 
in which all breaks were at a considerable distance from the Bar region. 
Votorov attributes these Bar changes to a remote position effect, but it is 
possible that the Bar change and the rearrangement are independent effects of 
different X-ray hits. This would be expected after irradiation with high dosage, 
but the dosage with which VoLorov’s stocks were obtained is not given. 


Genes interacting with the Bar locus 


If we adopt the concept that the Bar effect is due to an interaction between 
the Bar locus and certain other specific loci, strictly limited in number (or be- 
tween their products), some consideration must be given to the nature and 
function of these loci in the normal chromosome as well as in the rearranged 
chromosomes of Bar phenotypes. 

Bands which are known to be adjacent to the Bar locus (assumed to be 
represented by the band 16A1.2) in known Bar stocks, are the following: 
3E1.2 (B263-48), 16A7 (Bar duplication of Tice), 48C3.4 (B™ of Dos- 
ZHANSKY), 102F 2 (B*) and 3C2.3 (Bar of GriFFEN derived from B*); and also 
heterochromatin of the X chromosome (B™!, B!?, B263-47). In the Bar de- 
scribed by DuBININ and Gotpat (1936) the Bar effect may be due to hetero- 
chromatin either of X or of the second chromosome, the exact position of the 
break with respect to the Bar locus being undetermined. None of these bands 
with the exception of 3C2.3 represent loci of known mutants. The available 
evidence indicates that 3C2.3 represents the white locus. If a number of similar 
Bar changes could be obtained, in which the expression of the locus next to 
Bar was known to some extent, it might be possible to deduce something as 
to the nature of the gene interaction causing the Bar effect. 

It seems that some at least of the heterochromatic loci can enter into a 
specific reaction with the Bar locus similar to that of some euchromatic loci. 
Heterochromatin is known to have a non-specific effect of inhibiting the normal 
expression of genes transferred to its vicinity, producing a result similar to 
deficiency. The Bar cases considered here do not correspond to Bar deficiencies, 
but are, on the contrary, evidence of a specific effect of heterochromatin in 
stimulating a changed reaction in the Bar locus. Since it has been shown that 
these may also be interstitial heterochromatic regions (see KAUFMANN 1941), 
we cannot eliminate the possibility that heterochromatin is the effective agent 
even in Bar rearrangements not involving the chromocenter. 

In B rev.3 also heterochromatin of X is adjacent to the 16A1.2 band, yet 
the eye is completely wild type. One cytological difference between this stock 
and the Bar eyed stocks in which heterochromatin is involved is that the break 
(as far as can be accurately determined) is to the left instead of to the right 
of 16A1.2 in B rev.3, and this band is transferred to the chromocenter instead 
of having a small block of heterochromatin transferred to its neighborhood. 
This difference may be of some significance. On the other hand the absence of 
a Bar effect in this case may be due to a mutation in the Bar locus induced 
simultaneously with the break at 16Ar.2. 

The mode of action of the different loci affecting Bar is probably variable. 
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In fact, developmental studies (LUCE 1926; SEYSTER 1919; DOBZHANSKY 1932) 
show that the standard Bar eye of Tice, which is smaller at high temperatures, 
reacts differently from the infra-Bar or baroid eyes, both of which are smaller 
at low temperatures; and further studies would probably bring out other 
differences of this kind, which might be attributed to differences in the adjacent 
loci of the various Bar stocks. 

SUMMARY 


The Bar locus in Drosophila melanogaster is believed to be related to the 
band 16A1.2 in the salivary gland X chromosome. 

Comparison with a Bar deficiency shows that in its normal] position the Bar 
locus has no effect on the phenotype of the fly. 

The Bar effect is apparently produced by the interaction of the Bar 
locus, or substances produced by it, with certain other specific loci or gene 
products, when the Bar gene is brought into immediate contact with one of 
these loci. Several such loci are known. Their reactions with the Bar gene 
may differ considerably. 

The Bar effect is not essentially different from other known position effects. 

The Bar effect may be destroyed by mutation of one of the two interacting 
loci, as well as by separation of these loci through chromosomal rearrangement. 


LITERATURE CITED 


Brinces, C. B., 1917 Deficiency. Genetics 2: 445-465. 
1936 The Bar “gene’’ a duplication. Science 83: 210-211. 
1938 A revised map of the salivary gland X-chromosome of Drosophila melanogaster. 
J. Hered. 29: 11-13. 

DeMEREC, M., 1941 The nature of the gene. Cytology, Genetics and Evolution. Univ. Penn., 
Bicent. Conf.: 1-11. 

DeMEREC, M., and U. Fano, 1941 Mechanism of the origin of X-ray induced Notch deficiencies 
in Drosophila melanogaster. Proc. Nat. Acad. Sci. 27: 24-31. 

DeEMEREC, M., and E. Sutton, 1940 Unequal breaks in two sister chromatids induced by X-rays 
in Drosophila melanogaster. Proc. Nat. Acad. Sci. 26: 532-536. 

DoszHANSKY, T., 1932 The baroid mutation in Drosophila melanogaster. Genetics 1'7: 369-392. 

Dusiny, N. P., and S. J. Gotpat, 1936 A new case of Bar mutation. Biol. Z. Moskau. 5: 881- 
884. 

Dusrnin, N. P., and E. N. Vototov, 1936 A study of mutations arising at the Bar locus in 
Drosophila melanogaster. Bull. Biol. Med. Exp. 1: 327-329. 

Fano, U., 1941 On the analysis and interpretation of chromosomal changes in Drosophila. 
Cold Spring Harbor Symp. Quant. Biol. 9: 113-120. 

GriFFEN, A. B., 1941 The B* translocation in D. melanogaster and modifications of the Bar effect 
through irradiation. (Abstract) Genetics 26: 154. 

KAUFMANN, B. P., 1941 Induced chromosomal breaks in Drosophila. Cold Spring Harbor Symp. 
Quant. Biol. 9: 82-92. 

Luce, W. M., 1926 The effect of temperature on infra Bar, an allelomorph of bar eye in Droso- 
phila. J. Exp. Zool. 46: 301-316. 

MULLER, H.J.,1935 The position effect as evidence of the localization of the immediate products 
of gene activity. Fifteenth Int. Physiol. Cong., Leningrad. 
1936 Bar duplication. Science 83: 528-530. 

Mu_ter, H. J., A. A. PROKOFYEVA-BELGOvsKaya, and K. V. KossiKov, 1936 Unequal crossing- 
over in the Bar mutant as a result of duplication of a minute chromosome section. C. R. 
Acad. Sci. U.S.S.R. 1: 87-88. 





BAR EYE IN DROSOPHILA 107 


Rapoport, J. A., 1940 Multiple linear repetitions of chromosome blocks and their evolutionary 
significance. J. Gen. Biol. 1: 235-270. 

SEYSTER, E. W., 1919 Eye facet number as influenced by temperature in the bar-eyed mutant 
of Drosophila melanogaster (ampelophila). Biol. Bull. 37: 168-182. 

SturRTEVANT, A. H., 1925 The effects of unequal crossing-over at the Bar locus in Drosophila. 
Genetics 10: 117-147. 

Tice, S. C., 1914 A new sex-linked character in Drosophila. Biol. Bull. 26: 221-230. 

Vototov, E. N., 1937 A cytogenetic study of the Bar mutations in Drosophila melanogaster. 
Biol. Z. Moskau. 6: 907-922. 











PSEUDO-DEFICIENCIES AND TRANSLOCATIONS OF 
CHROMOSOME TIPS IN DROSOPHILA 


RICHARD GOLDSCHMIDT anp MASUO KODANI 
University of California, Berkeley 


Received October 21, 1942 


ROM the observation that broken ends of chromosomes will seek to unite 

with other broken ends rather than become healed MULLER (1932) con- 
cluded that the free ends of chromosomes must contain a structural element, 
necessary for the existence of the chromosome as a unit, which he called a 
telomere. If this is true, the existence of deficiencies or inversions at the tip 
of the chromosome becomes an impossibility. There are some exceptions which 
Muller concedes and considers to be “mutations of an interstitial gene into a 
terminal one.” (This is not the place to criticize this use of the words gene and 
mutation for a chromosomal structure and its change; it is objectionable, we 
think, to speak of a structural feature like the telomere in terms of genes, 
whatever idea of the gene we may entertain.) Actually, truly terminal defi- 
ciencies have been found not only in maize but also in Drosophila (DEMEREC 
and Hoover 1936, SUTTON 1940), some of Drosophila deficiencies being in 
use as standard tester stocks. The salivary structure is in these cases (re- 
checked by us, though this was unnecessary) unequivocal (about others see 
below). 

Meanwhile the junior author (KoDANI 1942) demonstrated the actual exist- 
ence of a visible structure at all free ends of the salivary chromosomes in 
D. melanogaster which might represent MULLER’s theoretical telomeres. By 
proper chemical treatment the end disk uncoils—as all bands do—into a num- 
ber of so-called perultimate chromomeres which are distinguishable from those 
of other bands by their size, smaller number, and different solubility. These 
findings drew our attention to the general problem of the telomere and made us 
reconsider the interpretation of a set of observations which we had made in 
another line of work and had originally interpreted as small terminal defi- 
ciencies and translocations. This reconsideration was also favored by the work 
of Hinton and Atwoop (1941) who described terminal adhesions of the free 
ends of the salivary chromosomes as a typical feature with varying percentages 
for different chromosomes and stocks. The junior author had frequently seen 
the same thing and we could now attribute the phenomenon to a special 
property of the visible telomeres, which one might call stickiness (see GoLp- 
SCHMIDT and KoDANI 1942). 

The facts to be reported here were found when certain stocks belonging to 
or derived from a mutable stock were carefully scrutinized for the smallest re- 
arrangements. What looked like terminal deficiencies and small translocations 
were found in different chromosomes of this stock as a typical feature (they 
were also found in a homozygous condition), and we tried in vain to relate them 
to definite genetical facts. The most frequent case was a deficiency of the first 
four bands in the X chromosome accompanied by a small translocation to the 
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tip of IIIR. As Sutton (1940) had found that even a two-band deficiency here 
had a yellow heterozygous effect (if opposite yellow), numerous tests were 
made for y and ac (actually before SutTon’s paper had appeared, and repeated 
afterwards), which were always negative. However DEMEREC and HoovER 
found a four-band deficiency, morphologically identical to the one mentioned 
above, which had no visible effect; a strange and unusual disagreement with 
SuTtTon’s findings—the importance of which has been discussed in a paper 
now in press. Our suspicions were aroused when we found the same tip defi- 
ciencies in other stocks, even in wild stock though less frequently, and we 
finally came to the conclusion that we were studying pseudo-deficiencies, 
caused by mechanical breakage of attached tips during the procedure of 
smearing. This interpretation would throw suspicion upon apparently similar 
cases described by others, as will be discussed below. The following observa- 
tions suggest that no special explanation may be needed for such cases. 


TABLE I 
STOCKS 
px bl poi poire 

Heterozygous matings 17 10 5 
Homozygous matings 6 2 3 
Slides heterozygous 9? 49 59 20 
Slides homozygous 9 18 8 6 
Slides heterozygous @ 30 26 10 
Slides homozogyous o 12 I 4 
Rearrangement in heterozygous 9 found in n 
glands (but not all nuclei), o glands in parentheses. 

1. Df(1)1A1-4 22 glands 42 glands 5 glands 

2. Df(2R)60F 4-5 20(8) 9(4) 2(2) 

3. T(3L)61A 7(1) 17(2) 2 

4. T(3R)100F 10(5) 38(13) 5(4) 
Rearrangement in homozygous 9 (<") 

3. T(3L)61A 5(1) 

4. T(3R)100F 3 I 
Together in same gland, rearrangement No. g 

t, & 34 3 times 3 times 

t,.& 4 I 2 

1, 2,4 3 ss I 

2, 3,4 6 I 

1,3,4 ra a 

1,2 4 I I 

2,3 I I 

2,4 3 4 

I, 3 I 3 

1,4 2 25 


Table 1 contains the results of a statistical survey carried out for three 
stocks marked px bl, poi, and poi? *'. In this survey the four types mentioned 
in the tables were found: (1) Df(1)1A1~—4, absence of the first four bands at the 
tip of the X chromosome; (2) Df(2R)6oF4-5, absence of two bands at the tip 
of IIR; (3) T(3L)61A, an apparent translocation of the tip of the X chromo- 
some to the end of IIIL, that is, the same as described by MULLER; and (4) 
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T(3R)100F, that is, the same attachment to IIIR. In other slides, not belong- 
ing to the series made for statistical study and thus not contained in the table, 
tip deficiencies for IIL, IIR, and IV were also found. These are pictured in 
figures 4, 9, 11. The procedure in the study to which the table refers was this: 
A number of crosses were made in both directions between the three stocks and 
Oregon wild. For each cross a number of one-gland slides were made, using 
female as well as male larvae. Further, a number of pairs of the pure stock 
were mated and handled in the same way, and for all these slides the presence 
of the four types of “deficiencies” or “translocations” in any nucleus of a gland 
was marked. All those reported in the table, including those marked in the 
column of homozygous slides, were heterozygous, that is, one tip only was 
affected. However, the homozygous condition (both tips) was frequently 
found, though less so than the heterozygous condition (see figs. 6, 8, 12). This 
difference may not be.significant, as the homozygous condition might be over- 
looked in more contracted chromosomes. 

The number of nuclei within one gland which showed the deficiencies and 
translocations was highly variable, as was also the number of these aberrations 
in a single nucleus. The most frequently found types (Df(1) and T(3R)) oc- 
curred in most nuclei of a gland simultaneously but normal nuclei were always 
found (about 10 percent). The less frequent types (2, 3 in table 1) were less 
often found together. Again the combinations of Df(1) and T(2)R or T(3L) 
were more frequent. 

Table 1 shows a very high incidence of the pseudo-deficiencies and trans- 
locations of the type 1 and 4. Whether the differences between the stocks are 
significant is difficult to tell. It is remarkable that the heterozygous changes 
were rather rare in the homozygous stocks though the homozygous condition 
was present (not marked in the table). This indicates that breakage is more 
easily accomplished in our mutant stocks than in the wild stock. 

We anticipated the conclusion that we were dealing with pseudo-deficiencies 
and translocations. Our conclusion is that the cohesion of the chromosome tips 
(by stickiness of the telomeres) is stronger than the cohesion within the chro- 
mosome itself near the tip and that therefore, in smearing the preparations, 
the tips with two to four bands are broken off and become stuck to the ends of 





EXPLANATION OF FIGURES I-12 


FicurE 1.—Association and pulling apart of tips of X and IIIR (poi/+). 
FIGURE 2.—Pseudo-deficiency of X A1-4 with thick terminal band (poi/+). 
FIGURE 3.—The same. 

FiGuRE 4.—Pseudo-deficiency tip IIL (new dp/+). 

FiGuRE 5.—Pseudo-translocation to tip of IIIL (poi/+). 

FicuRE 6.—Homozygous pseudo-deficiency tip of X (poi). 

FicurE 7.—Pseudo-deficiency IIR tip (poi/+). 

FIGURE 8.—Homozygous pseudo-translocation to IIIR tip (px bi). 
FicurE 9.—Pseudo-deficiency tip IIIR in wild stock (Canton X Oregon). 
FicurE 10.—Pseudo-translocation X to IIIR (poi/+). 

FIGURE 11.—Pseudo-deficiency tip of IV (new giant/+). 

FIGURE 12.—Homozygous pseudo-translocation to tip of X (px 6/). 
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other chromosomes. The data of the table indicate the highest degree of sticki- 
ness between X and IIIR, and the lowest cohesion within the chromosome 
near the tip of X. If this interpretation is correct, the apparent translocations 
ought to be always in inverted position. Unfortunately this could not be 
checked since the bands in the “translocated” piece of the X chromosome were 
too faint for reliable seriation. But a condition such as is pictured in figure 5 
looks like an inverted translocation. 

Such pictures as the one figured in figure 1 seem to show how the tip of a 
chromosome is pulled off under mechanical stress after sticking to another tip. 
In figures 2-10 the appearances of these pseudo-deficiencies and translocations 
are reproduced (for description of aberrations see explanation of figures). It 
was observed that rather frequently, though not always, the half-band adjoin- 
ing the break was much thicker than the other half. MULLER might have 
interpreted this (in case of a genuine deficiency) either as a translocation of a 
telomere from another chromosome or as a “mutation” of an ordinary band 
into a telomere. What the phenomenon means in our case of pseudo-deficiency 
is difficult to say. We assume that in such a case only three bands have been 
broken off and that the interval between the fourth and fifth was ruptured by 
the stress and consequently the fourth and fifth bands appear as a single thick 
one. We shall try to make special checks on the telomeres with KopANI’s 
method, if this is technically possible. The illustrations contain also a number 
of heterozygous and homozygous pseudo-deficiencies and translocations not 
found in the table. They are described in the explanation of the figures. 


DISCUSSION 


The interpretation of our findings as pseudo-deficiencies or translocations is 
based upon the frequency of these tip changes in many unrelated stocks, the 
certainty of their presence in only a part of the nuclei of a gland, Hinton and 
Atwoop’s findings concerning the union of chromosome tips, and upon such 
pictures as are presented in figure 1. (To this might be added that BripGEs 
once told the senior author, when discussing the variable aspect of the tip of 
X in standard slides, that in his experience this tip was easily broken off in the 
handling of the chromosomes.) It may be asked whether our conclusion can 
be extended also to cases described as real rearrangements. Thus, MULLER 
(1941) found a scute-J4 with a terminal deficiency and the missing piece ap- 
parently attached to the end of IIIL. MuLLeR calls this a double miracle (from 
the standpoint of his theory of the telomere) and postulates that a minute and 
invisible end piece of III must have been attached to the tip of X. We suspect 
this case to be in the same category as ours. DEMEREC and Hoover’s four-band 
deficiency of the X is present in all nuclei. This is a genuine deficiency if there 
is no indication that the missing four bands have been translocated to the 
tip of IIIR. Among the tip deficiencies described by BripcEs in D.I.S. No. 9 
one, Df(z)Ore®, is described as “apparently” homozygous, two others are de- 
scribed as homozygous (Df(2)Sw” and Df(2)Sw*), and two others (Df(3)D*H® 
and Df(3)Mz*) are mentioned only as observed in slides. All of these ought 
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to be rechecked nucleus by nucleus, as a number of clear findings in one gland 
might have prevented searching for normal nuclei. 


SUMMARY 


Apparent small endwise deficiencies and translocations in Drosophila were 
found to be only pseudo-deficiencies and translocations, produced by mechani- 
cal breaks near the chromosome tip following cohesion of the telomeres at the 
tips of two chromosomes. Different chromosomes and strains are more or less 
apt to undergo such subterminal breaks. 


LITERATURE CITED 


DEMEREC, M., and M. E. Hoover, 1936 Three related X chromosome deficiencies in Drosophila. 
J. Hered. 27: 206-212. 

GotpscumipT, R., and M. Kopani, 1942 The structure of the salivary gland chromosomes and 
its meaning. Amer. Nat. 76: 529-551. 

Hinton, T., and K. C. Atwoop, 1941 Terminal adhesions of salivary gland chromosomes in 
Drosophila. Proc. Nat. Acad. Sci. 27: 491-496. 

Kopant, M., 1942 The structure of the salivary gland chromosomes of Drosophila melanogaster 
II. Effect of urea-alkali mixtures and other chemicals on the structure of salivary gland 
chromosomes. J. Hered. 33: 115-133. 

MULLER, H. J., 1932 Further studies on the nature and causes of gene mutations. Proc. Sixth 
Int. Cong. Genetics. 1: 213-255. 

1941 Induced mutation in Drosophila. Cold Spring Harbor Symp. Quant. Biol. 9: 151-165. 

Sutton, E., 1940 Terminal deficiencies in the X chromosome of Drosophila melanogaster. 
Genetics 25: 628-635. 











ISOLATION BY DISTANCE* 


SEWALL WRIGHT 
The University of Chicago' 


Received November 9, 1942 


TUDY of statistical differences among local populations is an important 

line of attack on the evolutionary problem. While such differences can 
only rarely represent first steps toward speciation in the sense of the splitting 
of the species, they are important for the evolution of the species as a whole. 
They provide a possible basis for intergroup selection of genetic systems, a 
process that provides a more effective mechanism for adaptive advance of the 
species as a whole than does the mass selection which is all that can occur 
under panmixia. 


RANDOM DIFFERENTIATION UNDER THE ISLAND MODEL 

Mathematical consideration requires the use of simple models of population 
structure. The simplest model is that in which the total population is assumed 
to be divided into subgroups, each breeding at random within itself, except for 
a certain proportion of migrants drawn at random from the whole. Since this 
situation is likely to be approximated in a group of islands, we shall refer to it 
as the island model. 

The gene frequency (q) of a subgroup tends to vary about a certain equi- 
librium point (4) in a distribution curve (¢(q)) determined by the net sys- 
tematic pressure (measured by Aq, the net rate of change of gene frequency 
per generation from recurrent mutation, immigration, and selection) in con- 
junction with the cumulative effects of accidents of sampling (random devia- 
tion 5q, variance per generation oj.) (WRIGHT 1929, 1931, 1942). 


- 
— 
~~ 


¢(q) = (C/oda) exp [2 f arciaraa 


Let N be the effective size of the subgroup, m the effective proportion of its 
population replaced in each generation by migrants, and q; the gene frequency 
in the total population. The rate of change of gene frequency per generation in 
a subgroup, taking account only of immigration pressure, is Aqg= —m(q—4q). 
In a random breeding population oj,=q(1—q)/2N. Substitution in (1) gives 
the following, choosing C so that /}¢(q)dq=1 (WRIGHT 1931, 1942). 

r'(4Nm) 


(2) $(q) — qrreerhs a q)*Nm(i-av)—1 
l'(4Nmq.)l[4Nm(1 — q)] 


1 
f q¢(q)dq = a 
0 


* A portion of the cost of composing the mathematical formulae is borne by the Galton and 
Mendel Memorial Fund. 

1 Acknowledgment is made to the Dr. WALLACE C. and Ciara A. ABBOTT MEMORIAL FUND 
of the UNrversiTy oF Cu1caco for assistance in connection with the calculations. 
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1 

(4) = f (q — q)*o(q)dq = q(t — qr)/(4Nm + 1). 
0 


In the derivation of (1), it was assumed that Aq is sufficiently small that 
terms involving (Aq)* might be ignored. A more accurate value of o; may be 
obtained directly. The deviation of a local gene frequency from the average; 
(q—q:), tends to be reduced to (1—m)(q—qt) in the next generation. The 
mean sampling variance of (q+ Aq) is 


(5) I lq — m(q — q)][t — q + m(q — q:) ]6(q)dq 


= [qe(z — qr) — (1 — m)*05]/2N. 


Thus with a steady balance between the effects of immigration and of the 
accidents of sampling 


(6) a =(1- m)*o, + [qe(x -— qq ~ = m)*o3]/2N 
(7) oy = a(t — q)/[2N — (2N — 1)(1 — m)?]. 


This is approximately the same as (4) for small values of m but becomes 
qt(1 — qt)/2N, the sampling variance, in the limiting case of no isolation what- 
ever (m=1). This is about twice as great as given by (4) in this extreme case. 

The variance, excluding the immediate sampling variance may be obtained 
by multiplying (7) by (1—m)? as indicated in (6). Formula (4) lies between the 
values with and without the immediate sampling variance. 

Under exclusive uniparental reproduction, whether vegetative or by self- 
fertilization, the distribution of alternative genotypes may be treated by the 
same theory except for replacement of 2N by N. Immigration pressure is the 
same but the sampling variance is q(1—q)/N. 


THE INBREEDING COEFFICIENT 


Departures from panmixia may be expressed in terms of the average in- 
breeding coefficient of individuals, relative to the total population under con- 
sideration. This coefficient has been defined as the correlation between uniting 
gametes with respect to the gene complex as an additive system. It has been 
shown that its value can be found for any pedigree by finding all paths by 
which one may trace back from the egg to a common ancestor (A) and thence 
forward to the sperm along a wholly different path. According to the theory 
of path coefficients, the correlation between uniting gametes is the sum of 
contributions from all such paths (WRIGHT 1921, 1922b). 


(8) F = >> [(1/2)"st+=>+1(1 + Fa)] 


where F and Fy, are the inbreeding coefficients of the individual and of a 
common ancestor of sire and dam, respectively, and ns and np are the numbers 
of generations from sire and dam, respectively, to this common ancestor. In a 
population in which the average inbreeding coefficient is F, the frequencies 
of genotypes (one pair of alleles) are as follows (WRIGHT 1921, 19224). 
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Genotype Frequency 
( ) AA ~_m = qe?(1 — F) + qF 
9 
Aa vs = 2qr(1 oi qt) (1 — F) 


(1 - qt)?(1 — F)+ (1 = qt) F 


Il 


aa Zt 


The inbreeding, measured by F, may be of either of two extreme sorts: 
sporadic mating of close relatives with no tendency to break the population 
into subgroups, and division into partially isolated subgroups, within each of 
which there is random mating. The latter is the case in which we are primarily 
interested here. Assume that there are K subgroups each of size N. The pro- 
portion of heterozygotes within a subgroup is 2q’(1—q’) where q’ is the gene 
frequency in the parental generation, including immigrants. 


(10) yt = 2>>q/(1 — q’)/K = 2g — 2(>. q’”)/K. 


The variance of the gene frequencies of the subgroups, not allowing for 
accidents of sampling in the last generation, is 


K 
(11) ov = D (q’ — q)*/K = (20 q”)/K — gt 

1 
(12) yt = 2q¢(1 — qt) — 20, from (10) and (11) 
(13) oy = q(t — q:)F from (9) and (12). 


This formula does not allow for the contribution to variance due to acci- 
dents of sampling in the last generation. Thus it gives o4=o instead of 
o3=qt(1—q:)/2N for F=o. To compare with (7) it must be divided by 
(1—m)?. 


(14) o, = q(t — qi) F/(1 — m)? 


(15) F = (1 — m)*/[2N — (2N — 1)(1 — m)?] from (14) and (7) 
(6) m=1~V2NF/[GN- DF Fi] 


(17) o, = q(t — qi) [(2N — 1)F + 1]/2N. 


The formula F=1/[4Nm-+1] given in a preceding paper (DoBzHANSKY and 
WRIGHT 1941) is a satisfactory approximation if m is small. 

This island model is not likely to be exactly realized in nature. In most cases, 
the actual immigrants to a population come from immediately surrounding 
localities in excess and thus are not a random sample of the species. This can 
be remedied to some extent by multiplying the proportion of replacement by 
an appropriate factor to obtain the effective immigration index. If qm is the 
gene frequency in the actual immigrants (varying from group to group) the 
appropriate factor would be (q—qm)/(q—q:). Unfortunately the values for 
effective m for different loci may be very different. 
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LOCAL INBREEDING IN A CONTINUOUS AREA 


At the opposite extreme from the island model is that in which there is com- 
plete continuity of distribution, but interbreeding is restricted to small dis- 
tances by the occurrence of only short range means of dispersal. Remote popu- 
lations may become differentiated merely from isolation by distance (WRIGHT 
1938, 1940). 

Each individual has its origin at a particular place. Assume that its parents 
originated at distances from this place with a certain variance both in longitude 


‘ 
F, 
ee ay 


P gl as 
cis . * 
Gametes Oe ee a 


\- 49 
Zygotes oO 


Gametes OKf7%o 


Proba bility i/N (N-1) /N 


and in latitude. If the same condition held in preceding generations, the grand- 
parents originated at distances with twice this variance in longitude and in 
latitude and the ancestors of generation K originated at distances with K 
times this variance in both directions. The parents may be considered as if 
drawn at random from a territory with a certain radius R and effective popu- 
lation size N. The ancestors of generation K may then be considered as drawn 
similarly from a territory of radius \/K R and effective population size KN. 

We shall use the term parental group for the population (effective size N) 
from which the parents of an individual may be considered to be drawn; the 
term random breeding or panmictic unit will be used for any local population 
of the same effective size as the parental group. 

The assumption of random union of gametes, including self fertilization 
(probability 1/N) can be made with sufficient accuracy even though there is 
actually no self fertilization. It has been shown that such unions in a population 
of constant size N lead to fixation at the rate 1/2N in comparison with the 
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rate [((N+1)—\/(N?+1)]/2N either in a population of size N equally divided 
between males and females or in a population of N monoecious individuals in 
which self fertilization does not occur. As the latter formula may be written 
[1—(1/2N)- - - ]/2N the difference is ordinarily negligible (WRIGHT 1931). 

The inbreeding coefficient of individuals in such a population can be cal- 
culated from its definition as the correlation between uniting gametes. Let Fx 
be the correlation between random gametes drawn from a population of size 
xN and use primes tc indicate preceding generations as in the text figure (p. 117). 
The inbreeding coefficient itself would be F; in this terminology. The values 
of these coefficients can be expressed in terms of coefficients for preceding 
generations by tracing all connecting paths and noting that the path coeffh- 
cient .b, relating gamete to parental zygote, has the value /(1+F’)/2 and 
that the path coefficient, a, relating offspring zygote to one of the gametes 
that produced it, has the value \/1/[2(1+F)]. The compound coefficient 
ba’=} (WRIGHT 1921). It may easily be seen that (8) can be deduced at once 
from these considerations. 

In the case of continuity 


























- i ted. hte? -(=*) +5 
= —_ = 
<7 n° 

1 f/1+F” 2N -—1 
18) 4 F,)’ = — — }+ F,!’ 
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(19) Thus 








I1+F’ N—12(1 /14+F’ 
rE ete) 
2N N 2N 2 
+ I (7*)+ } 
2N 3N 2 ; 


If the same population structure has continued indefinitely, primes may 
be dropped. 


I1+F 1/N-1 1/N-1 2N-1 
at al le ow 
2N 2 N 3 N 2N 
1 /N — 1\/2N — 1\/3N —1 
OIE] 
4 N 2N 3N 
This is an infinite series, but in practice the value of F that is of interest is 
that relative to some finite population. The correlation between random 
gametes in a population of size KN is Fx which may be taken as zero, thereby 


stopping the series at (K—1) terms. Let tx be the xth term in the series in 
brackets and )|¥-'t the sum of first (K—1) such terms 
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(21) F=5t/ [x-<et] 
(x-—1)N-1 


(22) t. ————— € (x1). 
xN 


Let tx—0.5) = (tat x—1)/2 and At x—0.5) = tx—tex-1 
At ¢x-0.5) 2(N + 1) 


2 = — ‘ 
- t (x—0.5) N(2x — 1) —1 








If the values of t are treated as ordinates of a curve with abscissas x, we 


may write t and x in place of t;x-0.5) and (x—o.5), respectively. The following 
then hold approximately 


dt 2(N + 1) 
(4) = -— 
tdx 2Nx — 1 
I —(N+1)/N 
2 te Ciz-— 
(25) (x ~) 


K2-1 K2—-0.5 
(26) Zt f tdx approximately 
x 1 


3 I I —1/N I I —1/N 
a Qeaiet -2)"- st 
? K , 2 2N : 2 2N 


1 








The value of the constant C can be obtained by equating actual and esti- 
mated values of t. Estimates for all but the first few terms in the series are in 
close agréement. Thus if N=10 


Actual series [r+.45+.285+.206625+ ---] 
Estimated series C[1.05805+.47969+.30423+.22067+ +--+] 


The estimated value of C from the first term is .9451, from the second term 
-9381, from the third term .9363. The limiting value is .935774. The value of 
C approaches 1 as N increases. Thus for N= 100, C=.994157. 

Estimates of >. ¥~"t directly from (27) are not good approximations, but most 
of the error is in the first few terms. Good estimates can be made by using the 


actual values from (22) for these terms and the estimates from (27) for the 
later terms. For N=10 








Actual (22) Estimate (27) Error of Estimate 

3 

Dt 1.73500 1.86782 + .13282 

1 

9 
p> t - 79002 - 79250 + .00248 

39 
Dt -9Q5I1 -99541 + .00030 


dt .57228 .57228 + .00000 
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A priori, one would expect F to approach 1 as a limit as the size of population 
is increased without limit. This requires that }_;*t approach N. Trial for values 
of N from 10 to 10,000 indicates that this is actually the case and thus gives a 
good check on the theory. Following are examples: 











N=1I10 N=20 N=50 N= 100 
39 9 ED 
> t from (22) 3-52013 3.86519 4.09266 >t 2.797 
1 1 
>> t from (27) 6.47987 16.13481 45 -90734 Po t 97.203 
40 aw’ 10 

10.CCcO0O 20.00CCO 50.000c0 ICO.000 


LOCAL INBREEDING ALONG A LINEAR RANGE 


In a species with an essentially one dimensional range (parents drawn from 
the whole width) the extent along the range from which the ancestors of 
generation K are drawn is proportional to \/K as with area continuity, but 
the effective size of the corresponding population is \/K N instead of KN. 
By analogous reasoning 


(28) F=))t/(2N- )'t) 


where 


Ee=[ ‘ I (“—)+ I Gy“) | 
me ty V2 N V3 N V2N 


NV(x — 1) -1 

















(29) = — tix 
NvVx 1) 
teal At (x-0.5) 2N(Vx — 1 — Vx) — 2 
30 —— —— x : 
t (x-0.5) N(x — 1+ Vx) -1 


Treating this expression as the slope at the mid-interval and replacing 
(x—o.5) by x 


dt 2N(V/x—0.5—Vx+0.5) —2 





tdx N(V/x—0.5+Vx+0.5) —1 
Ss N[r + 1/(32x2) + --- ] + 2/x 

2Nx[1 — 1/(32x?) +--+ ] — Vx 
Ignoring 1/(32x*) and smaller terms in the brackets, this yields 


(32) t = Ce-¥*/N[y/x — (1/2N) J-h +04, 


(31) 





This seems to be as accurate an approximation as is warranted after replace- 
ment of At/t by dt/tdx. 

Comparisons of actual and calculated values of t indicate that estimates of 
C approach stability after a few terms. For N=10, C=1.1529 (from 30th to 
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4oth terms). For N=100, C=1.01465 (from gth and roth terms). For larger 
values of N, especially if x is 10 or more, it may be sufficiently accurate to take 
dt/tdx as —(N+2/x)/2Nx, C=1 


(33) t = e-*“x/N/,/x approximately. 
In this case 


Ke-1 


(34) >» t= N(e-2¥Ku/N = e~2¥K2/N) 
Ki 


The value of >-¥~'t can be approximated by finding actual }-%t from (29), 
estimating )-*,_' from (34) and multiplying the latter by the mean ratio of t 
from (32) to that from (33). Calculation of }>?t, N=10, by this method (by 
steps) gave 10.008 (instead of theoretical 10) and for N=100 gave 100.07 
instead of theoretical 100. These theoretical values are on the assumption that 
the limiting value of F is 1 which again is seen to be verified. 


CORRELATION BETWEEN ADJACENT INDIVIDUALS UNDER 
UNIPARENTAL REPRODUCTION 


The effect of isolation by distance on the frequencies of two alternative types 
in a population with exclusive uniparental reproduction can be treated simi- 
larly, again assuming that there are no complications from other factors. The 
treatment, however, cannot be in terms of the inbreeding coefficient. Let E 
be the correlation between adjacent individuals, and assume that there is 
short range dispersion in each generation such that individuals are derived 
from a parental group of effective size N. With area continuity, the ancestors 
of the Kth generation are drawn from a population of effective size KN. The 
correlation between adjacent individuals can be analyzed into two compo- 
nents, that due to the chance, 1/N, of derivation from the same parent and 
that due to the chance (N—1)/N, of derivation from different individuals of 
the group, the correlation between which may be represented by EF,’ in analogy 
with F,’ in the case of biparental reproduction. This in turn can be analyzed 
into a component due to the chance 1/2N of derivation from the same in- 
dividual of the second preceding generation and that due to the chance 
(2N—1)/2N of derivation from different individuals of this group, the cor- 
relation between which may be represented by E;3”. 





E I + N- 1 E, 
an ei 
(35) | or a2 ee 
2N 2N 
v7 I 3N sak. var 
Bel? ee mam fe Saree 9" te, 
{ 3N 3N 











122 SEWALL WRIGHT 























1,00 
N10 
$0 
G we 
v_60 
49;0-¢,) N,>50 
, Ny? 10" 
Ny: 10? 
‘ asian Ne: 10° 
Ni/Nuzk; | 10 10° «10 10" gta? 
R: /Ry 1 10 10” 10° io” 
1.00 
N;= 10" go 
Oz: 
= a" 
491-4.) 




















Ne /Ns=K, [te | 0 1 1 10° Jo" joe 10’ 10 
Re/R; 1 10 10 10 























Ni 2 Ny nT ny nT TT 


Figures 1 to 3. Variability of gene frequencies of local populations within a continuously inhabited 
area that extends indefinitely in all directions. It is assumed that there is no appreciable long range 
dispersal or mutation. Each curve applies to a particular size (Nu) of random breeding unit and 
thus to a certain amount of short range dispersal. Variability is measured by ox/+/qy(1—qy) 
where qx represents the gene frequencies of the subgroup in question and qy that of the compre- 
hensive population. 

Ficure 1 (top).—The variability of gene frequencies (qu) of the random breeding units them- 
selves, within areas up to 10‘ times their radius (Rj/Ru) or 10° times their population size (Ni/Nu). 
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Again we may drop primes if the same population structure has continued 
for a large number of generations. 


oo Pe CSCS) 
CSE] 


(37) E=)t/N. 


The series }_t is the same as encountered in the case of biparental reproduc- 
tion, but the formula for E differs from that for F. It resembles it in approach- 
ing 1 asa limit, as is to be expected a priori, but for a given N, E is about twice 
as great as F for small values of >-t, and the difference from the limit is only 
about half as great if >> is close to 1. These relations are illustrated in figures 
7 and 1 dealing with uniparental and biparental reproduction, respectively. 

In the case of linear continuity and derivation of individuals from a parental 
population of N, the effective size of the population of the Kth ancestral gen- 
eration is \/K N, again as under biparental reproduction. By analogous reason- 
ing E=)_t/N where > _t is the same series as in the biparental case. The relation 
of E to F for the same N is similar to that described above in the case of area 
continuity. 








(36) 





RANDOM DIFFERENTIATION OF PANMI€TIC UNITS IN A CONTINUUM 


Returning to biparental reproduction, the situation in a random breeding 
unit imbedded in a continuous population of defined size may be compared in 
some respects with that in an “island” whose population is replaced to such 
an extent in each generation by migrants representative of the whole that the 
inbreeding coefficient of individuals is the same. There is the important differ- 
ence that adjacent groups should be closely similar in the former but uncor- 
related in the latter. Nevertheless the amount of differentiation among groups 
taken at random from the whole should be the same in both cases since equa- 
tions (9) to (17) apply in both. It is most convenient to use \/ F = o4/Vqi(1 — qu) 
(from (13)) to measure this differentiation. It should be noted that this ex- 
cludes the variability due to the immediate effect of sampling. 

The theoretical variabilities of random breeding units of various sizes (10 
to 10,000) within populations up to 10° times the size of the units (or 10‘ times 
the radius), continuous in all directions, are compared in figure 1. In interpret- 
ing this variability, it may be noted that if q.=}, a value of \/F (ordinate) 





K; is the average number of generations of separate ancestry of random individuals of the popula- 
tion Ni. 

FicurE 2 (middle).—The variability of gene frequencies (qj) of populations of a given size, 
N\=10‘, within areas up to 10‘ times their radius (Ri/Ri) or 10% times their population size 
(N:/Ni). Note the similarity to Figure 1. 

FicurE 3 (bottom).—The variability of gene frequencies (qi) of populations of any size,-Ni, 
within a region with a population of a given size, Ny=10°. 
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greater than .577 means a U-shaped distribution of gene frequencies and thus 
very great differentiation. The situation is similar to that found where Nm 
is less than o.5 in the island model. There is important differentiation down 
to at least \/F =.22 (equivalent to Nm=s). There is only slight differentiation 
if \/F is less than .07 (equivalent to Nm= 50) (cf. fig. 1, WRIGHT 1940). 

It is apparent from figure 1 (this paper) that there is a great deal of local 
differentiation if the random breeding unit is as small as 10, even within a 
territory the diameter of which is only ten times that of the unit. If the unit 
has an effective size of 100, differentiation becomes important only at much 
greater relative distances. If the effective size is 1000, there is only slight 
differentiation at enormous distances. If it is as large as 10,000 the situation 
is substantially the same as if there were panmixia throughout any conceivable 
range. 

The situation is very different as may be seen from figure 4 in a species whose 
range is essentially one dimensional (for example, a shore line). Different 
alleles may approach fixation in different parts of a range only 100 times the 
length of the random breeding unit if the effective size of the latter is less than 
100. The range must be about 1000 times the length of the unit if the latter 
has a size of 1000 and about 10,000 times its length if the size of the unit is 
10,000 to give this result. This difference between area and linear continuity 
has been suggested on a priori grounds by THOMPSON (1931) in connection 
with a study of the correlation between water distance and amount of differ- 
entiation within species of fish. 


RANDOM DIFFERENTIATION IN A HIERARCHY OF SUBDIVISIONS 


The attempt to apply these conclusions to actual cases is hampered by the 
difficulty of determining what are the random breeding units and their effective 
sizes. To obviate this, we should find how groups of any arbitrary size vary 
within a more comprehensive population. 

Consider a total population, size Nt, subdivided into H groups of intermediate 
size N; and these in turn subdivided into K random breeding groups of size 
Nu. The inbreeding coefficient of individuals is zero relative to the unit groups, 
F; relative to the intermediate groups and F; relative to the total. Both H and 
K, in contrast with Nu, will be treated as large numbers. 

The variance of the gene frequency (qu) of unit groups within the inter- 
mediate groups is given by (17) using the proper subscripts. The average 
value of this variance will be represented by o%.;. The variance of the mean 
gene frequencies of the intermediate groups in the total will be represented by 
oj. and that of qu in the total by 03... 


‘ H 
(38) from (17) Ou-i= >, [qi(t—qi)][(2Nu—1)Fit1]/2HNu 
(39) oui [qe(t— qe) —o1-«] [(2Nu— 1) Fi+1]/2Nu 
(40) from (17) oat =Qe(t— qe) [(2Nu—1) Fi+1]/2Nu 


2 2 
(41) Out =O2.i+Gi- 
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Figures 4 to 6. Similar to figures 1 to 3, respectively, except that a linear range (such as a shore 
line) is postulated. 

Ficure 4 (top).—The variability of gene frequencies (qu) of the random breeding units them- 
selves, within ranges up to 10‘ times their length (Li/Lu) or population (Ni/Ny). 

FicureE 5 (middle).—The variability of gene frequencies (qi) of populations of a given size, 
N;= 10‘, within ranges up to ro‘ times their length (L;/Lj) or population (N:/Nj). Note the dis- 
similarity to figure 4 in contrast with the similarity of figures 1 and 2. 

FicureE 6 (bottom).—The variability of gene frequencies (qi) of populations of any-size (Ni) 
within a range with a population of a given size, Ny= 10°. 
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e. t= [q(t — qu) — 03-4] [(2Na— 1)F\+1]/2Nu 
St: 

(43) Equating (40) and (42) oi.=qe(t—q.) [Fr—Fi]/[1—Fi]. 


(42) from (39) and (41) 


This demonstration involves the assumption that there is inbreeding rela- 
tive to the intermediate groups because these are subdivided. It may be noted 
that the same value of o{., may be derived as follows without this assumption. 
(44) From (9) ye = 2qe(1 — qe)(r — Fi). 


But y; is also the average heterozygosis of the intermediate groups 
H 


> [eqi(t — qi)(t — Fi) J/H 


coor (o-(£u)a 


H H 
(46) Hin = =. (qi = q:)?/H = ( > at)/H = .*. 


(47) From (45) and (46) ye = 2[1 — Fi] [qe(x — qd) — oie]. 
(48) From (44) and (47) oi-+ = qe(t — qu (Fe — Fil/[1 — Fil. 


yt 


(45) 


a 


In neither demonstration is there any assumption as to the geographic dis- 
tribution of the values of the mean gene frequencies, qi, within the total. They 
may be distributed at random as implied in the island model or there may be 
gradients as expected with continuity. 

The quantity 

(Fi. Fi) Oi-t 

(r—Fi) Vat — a) 
may be used as an index of the amount of differentiation among populations of 
any size N; within a more comprehensive population (N;). The variabilities 
of populations of effective size Nj = 10,000 are considered in figure 2 (area con- 
tinuity) and figure 5 (linear continuity). In the case of area continuity the 
curves are somewhat similar to those shown in figure 1 for unit groups. It 
appears that populations of 10,000 (or any other size) exhibit about the same 
amount of differentiation within a whole whose population is a certain multiple 
of their own as the unit groups exhibit in a population that is the same multiple 
of their size. Whatever the size of the subpopulations considered the varia- 
bility depends on the size of the inbreeding unit. There is an important amount 
of differentiation among large regions if the unit group is as small as 10, appre- 
ciable differentiation if the unit group is as large as 100 but little if it is as large 
as 1000. It should be said that there are important qualifications if there are 


other factors (mutation, rare long range dispersal or selection) which will be 
considered later. 
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The situation differs considerably in the case of linear continuity. Groups of 
size Nj= 10,000 approach the limiting amount of differentiation within popu- 
lations only three times their length of range if Nu= 100 or less. There must be 
virtually complete fixation of one allele or the other over long distances with 
only short regions of transition. If Ny= 1000 there is relatively little differ- 
entiation within 10-fold lengths (that are heterallelic at all) but an approach to 
100 percent differentiation in 100-fold lengths. Thus transition regions are of 
the order of 10 lengths. If N, = 10,000, the transition regions are of the order 
of 10 lengths, and such groups approach roo percent differentiation within 104 
lengths. 

The interpretation of figures 1, 2, 4, and 5 is somewhat complicated by the 
fact that these do not measure variability on a constant scale. The denomi- 
nators of the ordinates (namely, ~/qi(1—qt) in 2 and 5) increase with the 
abscissas. The tendency toward fixation of large populations means that at 
the lower abscissas the average value of q; must be close to o or 1, making 
Vqi(1—q:) small. The structure of a population is exhibited in perhaps the 
most easily interpreted form by considering a constant comprehensive pop- 
ulation N; and showing how much differentiation there is among subdivi- 
sions of all sizes from the random breeding units up to major subdivisions 
(q,/V qt(1 —qz) plotted against Ni). Here the denominator is constant so that 
variability is always on the same scale. 

Figure 3 shows that with area continuity, the amount of differentiation falls 
off slowly with the size of the subdivision considered. If Nu=10 and N; is 10° 
(or any other size in the absence of other factors) there is marked differentia- 
tion among populations that are 1o percent of the total, although much less 
than among subdivisions of smaller sizes. If Ny= 100, there is only moderate 
differentiation among the smaller subdivisions and very Jittle among ones that 
are as large as 10 per cent of the total. In the case of linear continuity (fig. 6) 
there is virtually complete fixation of all subdivisions up to-10 percent of the 
total if Nu is 100 or less. If, however, Ny is 1000 there is a considerable propor- 
tion of these unit groups that are not fixed (o4,/qt(1 —qt) =-87). The differ- 
entiation among larger populations up to Ni=o.1 N; is not appreciably less 
than among the unit groups. If Ny= 10,000, o4,/Vqt(1—q¢) is only .10, but 
this index is practically as great among larger populations up to ro percent of 
the total. Thus with linear continuity most of the differentiation is that among 
large subdivisions of the total (of the order of 10 percent of its size). With area 
continuity, differentiation is more uniformly distributed at all levels. 

Area and linear continuity as well as the island model are ideal cases. There 
may be all grades of intermediacy between area and linear continuity as ex- 
hibited in branching and reticular distributions. Even with rather complete 
area continuity there are almost certain to be variations in density of popula- 
tion. The ancestry of individuals in the centers of high density would spread 
out less rapidly than under the ideal theory with the consequence that there 
would in general be more differentiation among such centers than indicated, 
unless this is interfered with by other factors, which must now be considered. 
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COMPLICATING EFFECTS OF MUTATION AND LONG RANGE DISPERSAL 


The foregoing theory indicates the possibility of an approach to fixation of 
different alleles in large areas of the same continuous population without the 
help of any differential action of selection. It is obvious, however, that this 
very slow process would be greatly affected by other factors that change gene 
frequency. The very fact of persistence of more than one allele over a long 
period of time tends to indicate that such factors are present in some sort of 
balance. Thus there may be reversible mutation, selection opposed by muta- 
tion or selection against both of two homozygotes. Moreover, the short range 
means of dispersal] that have been postulated are likely to be supplemented by 
occasional long range dispersal. All of these tend to prevent fixation of one 
type even locally. On the other hand, selection may favor one allele in some 
places and others in other places. This would tend to increase local differentia- 
tion. It is necessary to consider how such processes affect the situation. 

It will be well to review first the joint effects of recurrent mutation and long 
range dispersal in the case of the island model (WRIGHT 1931). The rate of 
change of gene frequency under recurrent reversible mutation varies linearly 
with the gene frequency: Aq=v(1—q) —uq= —(u+v)(q—4) where v is the 
mutation rate to the allele in question, u is the rate of mutation from it and 
4(=v/(u+v)) is the value of q at equilibrium, which is the same in this case 
as G the mean value of q. This is similar in form to the expression for the effects 
of long range dispersal: Aq= —m(q—q). 

If both processes are occurring, the expressions merely need to be added: 


(49) Aq = v(t — q) — uq — m(q — qe) = — (m+ u+ v)(q — Q) 
where 
q = 4 = (mq + v)/(m+ u + v) 


for a local population in which @ is not necessarily the same as gene frequency 
for the whole species (qt), since other factors may be at work in other localities. 
The long time distribution for such a population is approximately 


(50) $(q) = Ca @mattv)—1(7 — q) 8 im C-at)tul—1 
G(x — 4)/[4N(m + u + v) + 1]. 


If conditions are the same in all islands, G=q.=v/(u+v) and the variance 
qe(1— qt) /[4N(m+u+v)+1] is not only the long time variance for a single 
island but also the variance of q, at any time, among the islands. 

The variance of subpopulations (inbreeding coefficient Fj) in a total rela- 
tive to which the inbreeding coefficient is F, has been given (43, 48) as 
qe(t1— qt) [F:—Fi]/[1— Fi] applicable to any case, including both the island 
model and that of a continuous population with only short range dispersal. 
The effective value of the immigration index in the latter may be obtained by 
equating with the expression for o,? given in (51). 


(52) m = [1 — F,]/[4N(F: — Fi)]. 


(51) O° 
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At first sight it might appear that the rate of change of gene frequency in 
cases in which there is long range dispersal and reversible mutation (joint 
coefficient m,) in addition to predominant short range dispersal (coefficient 
m2 from (52)) might be obtained by simply adding the contributions from 
these sources as calculated from their effects by themselves. This, however, 
overlooks the likelihood of an important interaction effect. It is necessary to 
go back to the formula for the correlation between uniting gametes (18) and 
determine how it is affected by mutation and long range dispersal. 

Assume that the proportion m, of the gametes represent a random. sample 
from the whole species. The identity of the theories of long range dispersal 
and mutation make it possible to let m, here represent (m+u+v) of preceding 
formulae. Cases in which one or both of the uniting gametes are included in 
this proportion make no contribution to the correlation between uniting 
gametes. The proportion which makes a contribution is (1—m,)*. Equations 
(18) are accordingly to be modified as follows: 


[rao mf) oC 


F,’ = (1 - m)'{ ‘bd )+ xl F”| etc. 
2N 2 2N 





(53) 





Again primes may be dropped, if the same situation has held for a long time. 


[- + =|[o aes (1 - =a ts ~ ‘) 
2N 2 N 











™ (1 — m)*/N — 1\/2N -—1 
ere a Fe ee 
3 N 2N 
(55) t = C(1 — m)*[x — (1/2N)]-S+»N 
Kr-1 K2—0.5 
(56) )} t=C (x — m,)?*[x — (1/2N)]-8+”/Ndx approximately. 
Ki K)—0.5 


This is a less convenient expression than obtained where m;=o, but approxi- 
mate values can be obtained by taking values of K at short enough intervals, 
finding 


K2-0.5 K2-0.5 
| f (1 - m,)*dx| [c [x — (1/2n) ax 
K,—0.5 K,—0.5 


and correcting according to the percentage error where both factors are of the 
form UeE*dx] with the K’s chosen so as to give the same ratios of terminal 
ordinates. 


(57) r= <t/[- re]. 


1 






































130 SEWALL WRIGHT 
1.00 
N, 210 
80 Nu 
« 
te NY=5: 
£9.('-9)) 
no N,=100 
20 
0 2 > + s 6 7 s 
Ni/ Ny: K; | 10 0 10 10 10 0 10 10 
R:/Ry i 10 10° 10 10 
100 
N y= lo 80 
oO; 
agieillll 
49.0-%) 
40 
20 
N; / Ny: Kj 
R; /R, \ 10 10" 10” 10° 
1.00 
N, = 10 
? m: 0 
N, = 10 80 - 
OF; 
— 
¥7,(I- 9) g* 
40 9 
-20 io” 
0 2 3 4 6 7 
Ni: KN, 10) «610° «10° «10% 1 (10 1” ta? 


FiGureE 7 (top).—Similar to figure 1 except that exclusive uniparental reproduction is assumed. 
Nu is the population size of the group from which the parents of adjacent individuals are drawn 
at random and thus measures the extent of dispersal. The curves show the variability of gene 
frequencies (qu) of such unit groups within areas up to 10‘ times their radius (R;/Ru) or 108 times 
their population (Ni/Nu). Each curve applies to a particular extent of short range dispersal. 

Ficure 8 (middle).—The effect of occasional long range dispersal or mutation (rates up to 
m= 107%) on the variability of gene frequency of random breeding units of size Ny= 10 within areas 
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Figure 8 shows how oq,/\/qi(1—qi)= V Fi, for parental populations of size 
Nu= 10, rises with the size of the population (Nj=KN,) in the presence of 
random replacement (m) in the proportions 10-7 to 10-*. The variability of 
the unit population is substantially the same as if there were no mutation or 
long range dispersal if Nj is less than 1/m,, but rather abruptly approaches 
a limit in larger populations. Instead of approaching 1 as when m,=o, VF; 
approaches 0.81 if m;=10~", 0.76 if m;=10-*, 0.70 if m;=10-5, 0.63 if my 
=10*, 0.55 if m1=10-%, and 0.44 if mi1=107°. 

Figure 9 shows how the variability of subpopulations of any size N; within 
a total population of size 10° is affected by the value of m,. It is again assumed 
that short range dispersal is such as to give Nu=10. There is very little differ- 
entiation in this case of subpopulations larger than 30/m. It is clear that it 
requires only a small amount of long range dispersal or mutation to prevent 
the differentiation of large populations. 

The amount of differentiation of populations, that are a given multiple (Kj) 
of the unit population, falls off rapidly with increase of Ny. But the multiple 
beyond which differentiation virtually ceases is largely controlled by the factor 
(1—m,)? and is thus nearly the same for all values of Ny under which there is 
any appreciable differentiation at any level. The value of (1 —my,)** is reduced 
to approximately 10 per cent of its value each log.10o/2m, generations (assum- 
ing m, to be small). 

Among populations of a given absolute size (Nj) there is, therefore, a certain 
range of dispersal (determining N,) that is most favorable to differentiation 
in a continuous population, On the one hand, if the range of dispersal is such 
that N, is larger than 1000, there is very little differentiation, but on the other 
hand, if N, is so small that there are more than 3/m,; random breeding units 
in the population under consideration, there is also virtually no differentiation. 

Linear continuity may be treated similarly, by multiplying the terms of 
(32) by (1—m,)*™*. 

Under exclusive uniparental reproduction, the chance that an individual is 
derived from the parental population without mutation is (1—m,), instead 
of (1—m,)*. Each term in the series >t is accordingly to be multiplied by 
(1—m))*. 

The formula for the distribution of gene frequencies among subpopulations 
of a given size, Ni, in the total Ny, may be written approximately as follows: 


(58) $(q) = Cqlt-Fe)/ Pe-F) lar-1(7 a q) (Fe) / Pe- Fj) 1 -ae)—1, 


Here the F’s incorporate the effects of mutation and long range dispersal as 





up to ro‘ times their radius or 10® times their population size. The highest curve (m=o) is the 
same as the highest curve in figure r. 

FIGURE 9 (bottom).—The effect of occasional long range dispersal or mutation (rates up to 
10~*) on the variability of gene frequencies of populations of any size, Nj, within a region with 
a population of a given size, Ny= 10°. The random breeding unit is assumed to be Nu=1o0. The 
highest curve (m=o) is the same as that in figure 3. 











132 SEWALL WRIGHT 


well as of short range dispersal. This distribution has the mean q; and the 
variance q:(1—q:)[F:—Fi]/[1—F] derived above. It differs considerably from 
the distribution 


(59) $(q) _ Cath (artma)ae- 1(y _— qi teeta get 


if mz is the estimate of effective m from (52) based on the value of Fj and F;, 
under short range dispersal in the absence of other factors. It is legitimate, 
however, if m; is known, to write ¢(q) in the form of (59) with the under- 
standing that mz measures the effect of short range dispersal in the presence 
of the other factors measured by m, with full allowance for the interaction 
effect. Indeed this seems to be the only practicable method to use in analyzing 
data from actual populations in view of the fact that no ideal model such as 
area or linear continuity is likely to be exactly realized. 


THE EFFECTIVE SIZE OF INBRED POPULATIONS 


The effect of inbreeding on the effective size of populations is a matter that 
requires some consideration. Size of population enters into the formulae for 
the distribution of gene frequencies principally through the sampling variance 
which is q(1—q)/2N in a random breeding diploid population. Assume that 
individuals have an inbreeding coefficient F; relative to an island population. 
It makes a difference in the sampling variance whether this is due to mating 
of relatives, not resulting in any territorial subdivision, or whether it is due 
to partial isolation of subdivisions that breed at random within themselves. 
In the former case, the increased frequency of homozygotes causes an in- 
creased sampling variance of the whole island. If there were nothing but homo- 
zygotes, (qiAA+(1—q;)aa), as under long continued self-fertilization, the 
sampling variance would be qi(1—qi)/Ni, twice that under random mating. 
With random bred and inbred components in the array of equations (g) in the 
proportions (1—F;) to Fj, the sampling variance would be the weighted aver- 
age. 


o" tq, = (1 — Fi)qi(t — qi)/2Ni + Fiqi(t — qi)/Ni 
= qi(t — qi)(1 + Fi)/2Ni. 


(60) 


If on the other hand, the island population is subdivided into partially iso- 
lated groups that breed at random within themselves and if each group tends 
to maintain its numbers (that is, there is no intergroup selection) the sampling 
variance of the total island population is Jess than if there were random mating 
throughout. In each subgroup, the sampling variance is qu’(1— qu’)/2Nau, aver- 
age oq, => ¥ qe’ (1— qu’)/2NuK. The sampling variance for the mean gene fre- 
quency of the island would be 054;=07%q,,/ K=) fqu’(1 —qu’)/2NiK if Ni 
= KN,. But from(10) yi= 2 Kq/ (1—qu )/K. Thus 0734; = yi/4Ni. From (9) yi 
= 2qi(1—qi) (1—F*) giving 


(61) ta, = qi(1 = qi)(1 - F;)/2Ni. 
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The situation in an arbitrarily delimited region in a continuum resembles 
the second. Effective N in such a formula as (59) is thus KN./(1—F)). 


COMPLICATING EFFECTS OF SELECTION 


Consider next the complications introduced by selection. The effects of vari- 
ous kinds of selection on gene frequency (contributions to Aq) and the form 
taken by ¢(q) on substitution in (1) have been discussed in previous papers 
(WRIGHT 1931, 1942). These are applicable directly to the island model. The 
case of arbitrarily delimited portions of a continuum can be treated in the 
same way, but if so, m2 of formula (59) includes the interaction effect of 
selection as well as of mutation (and of long range dispersal if this can be dis- 
tinguished from the short range dispersal). The index mz is to be interpreted 
as the effective amount of replacement of the subpopulations in question by 
representatives of the species as a whole under the conditions of mutation and 
selection that actually hold. As noted in connection with the complications in- 
troduced by mutation and long range dispersal, this seems to be the most 
practicable method of dealing with concrete data. It is important, however, to 
determine the theoretical relations between the values of m among subdivi- 
sions of different sizes under various ideal population structures. 

For such theoretical consideration of the interaction of selection with the 
effects of short range dispersal, it is necessary to return to the derivation of F 
by path coefficients (18) in analogy with the treatment of the complications 
due to mutation and long range dispersal (53). But in attempting to carry out 
the analogy we encounter a serious difficulty. 

Long range dispersal (by definition) and mutation may be treated as intro- 
ducing a random admixture into the local population in constant proportion 
m. Selection may also be treated as introducing a certain random admixture, 
but it is not in constant proportion. The amount of such admixture in the 
case of mutation and long range dispersal may be represented as 


(62) [— Aq/(q — 4)] = (m+u+v) =m™. 


This formula may be applied where Aq also involves selection pressure. Con- 
sider the case of a balance between opposing pressures of mutation and selec- 
tion in the simplest case, that of no dominance, and assume that the same 
situation holds throughout the species. 


(63) Aq 
(64) m 


v(1 — q) — sq(t — q) = — s(t — q)(q— q) where q = v/s 
[— Aq/(q — 4] = s(x — q). 


The joint effect of mutation and selection in this case is equivalent to im- 
migration of a random sample, but to an extent that is a function of the local 
gene frequency. A rough idea of the effect may be obtained by substituting 4 
for q and treating m1=s(1—q)=s—v as a constant. If s is much larger than 
v we may indeed simply take m,=s and use (1—s)? in place of (1—m),)? in the 
theory developed for mutation and long range dispersal. Inspection of figures 8 
and 9 shows how selection of this sort interferes with the differentiation that 
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would occur within the continuous population under the specified conditions 
if there were no complication of this sort. 

As another example consider the case of selection against both of two homo- 
zygotes. Representing the relative selective values of AA, Aa and aa by 
I—Saa, I and 1—Saq respectively 


(65) Aq == (SAA + Saa)q(1 sia q)(q i q) 
where G = Sas/(Saa + Saa) 
(66) m; = (saa + Saa)q(I — q)- 


While selection does nothing to local populations that have become fixed 
and the equivalent immigration index m, is accordingly o if q is either o or 1, 
the average value may well be such as to severely restrict differentiation of 
even rather small subdivisions of a continuous population. Again a rough idea 
of the effect may be obtained by substituting q for q. It should be noted that 
if there are numerous alleles and selection for heterosis is general, selection 
tends to increase differentiation. 

In a recent paper (WRIGHT, DoBZHANSKY and HovANITz 1942) an attempt 
was made to interpret the frequencies of lethals in a continuous population of 
Drosophila pseudoobscura on Mt. San Jacinto. The following formula was ar- 
rived at for the rate of change of the frequency of a typical lethal gene. 


(67) Aq = ¥(1 — q) — m(q — Gq) — q(8 + F) — q(t — 38 — 2F) 


where v is the mean mutation rate per generation, s the mean selective dis- 
advantage of heterozygotes, G the mean gene frequency, F the inbreeding 
coefficient, and m the effective immigration coefficient of the territory under 
consideration. It was shown that approximately the same variance of gene fre- 
quencies was reached by replacing the above expression by one in which the 
component of Aq, measuring the tendency toward increase of gene frequency— 
namely, (¥-+mq) (1—q), is balanced by the linear expression that gives the 
same mean as the correct expression namely, —(¥+mq) (1—4)q/G 


(68) Aq 


Il 


— (m + ¥/q)(q — 4) approximately 
(69) m, = (m + 0/q) approximately. 


DIFFERENTIATION OF SUBDIVISIONS BY SELECTION 


If selection acts differently in different regions, it is obvious that none of 
the formulae given here apply to the distribution of values q among these re- 
gions, but only to the long term distribution within single ones. As a basis for 
discussion consider the following simple case, which refers to rate of change of 
gene frequencies in an island as affected by the local conditions of selection 
measured by s (assuming no dominance) and the amount of immigration meas- 
ured by m (WRIGHT 1931, 1940). 


(70) Aq = sq(1 — q) — m(q — q). 
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In a local population in which s (whether plus or minus) is smaller in abso- 
lute value than m, gene frequency can depart only slightly from the average 
of the species (4=q:+(s/m)q:(1—qt)) approximately. Crossbreeding here 
swamps the tendency toward selective differentiation. On the other hand, 
local gene frequency tends to be dominated by the Jocal conditions of selection 
in populations in which s is larger than m in absolute value = 1—(m/s)(1—qt) 
or q=(—m/s)qt approximately, depending on whether s is positive or nega- 
tive. 

The effectiveness of selection here is not related directly to the size of the 
island population. However, there is likely to be indirect relationship. This 
may be illustrated by considering three situations. 

First, consider islands with various populations but the same absolute 
amount of immigration (as might well be the case if the areas are the same but 
population densities differ). Among such islands, Nm is constant. All have the 
same amount of nonadaptive differentiation (measured by 1/(4Nm-+1) but a 
given selection pressure is more effective on the islands with larger population 
(and hence smaller m) than among those with smaller populations. 

A second situation is that in which size of population is proportional to 
area and the number of immigrants is proportional to the extent of boundary 
(Nm«4/N). Here there is more nonadaptive differentiation on the smaller 
islands and more adaptive differentiation of the larger ones, although the latter 
effect is less marked than in the preceding case. 

Finally, if both size of population and amount of immigration are propor- 
tional to the area (m constant), there is markedly more nonadaptive differ- 
entiation on the smaller islands but no relationship between adaptive differ- 
entiation and size of population. 

Summing up, any sort of differentiation is favored by small m, but the large 
populations tend on the whole to exhibit predominant adaptive differentiation, 
while the smaller ones exhibit predominantly nonadaptive differentiation. 

The situation in a continuous population is similar in that nonadaptive 
differentiation should be most conspicuous locally and adaptive differentiation 
among larger subdivisions. The most significant thing, however, given a cer- 
tain amount of differential action of selection, is the size of the random breed- 
ing unit. If this is large—for example, over 1000, very little nonadaptive 
differentiation is to be expected and only rather strong differences in the action 
of selection avoid swamping. If on the other hand, there is only short range 
dispersal—for example, Ny= 10, large regions tend to become adaptively dif- 
ferentiated under the influence of slight differences in selection, and superim- 
posed on this should be a large amount of nonadaptive differentiation of small 
regions. The maximum amount of nonadaptive differentiation among popula- 
tions of a given size however, is not found with the smallest Ny, but at a certain 
optimum value. 

If a population spreads over a large territory in which the environmental 
conditions are substantially uniform, there would primarily be only nonadap- 
tive differentiation, the amount depending on the value of m or of N, depend- 
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ing on the model that is most appropriate. With such differentiation occurring 
simultaneously but more or less independently in all series of alleles, each local- 
ity would have a slightly different genetic system from every other locality. 
These systems may be expected to differ in their success in meeting the en- 
vironmental conditions. Among those which are relatively successful, adapta- 
tion is likely to have a slightly different basis in each case. The populations 
with such systems tend to become denser and to send out more than their share 
of migrants and thus enlarge in extent. Each would tend to perfect the line of 
adaptation on which it had started. Thus permanent differential action of 
selection would soon be brought into play in spite of the postulated uniformity 
of the conditions. 

The expansion of centers of population characterized by certain genetic sys- 
tems and contraction of those characterized by other systems is the process of 
intergroup selection referred to in the opening paragraph. The genetic system, 
including its state of heterogeneity as well as its central type, is the basis of 
selection instead of merely the net favorable or unfavorable effect of each single 
gene, which is the only basis for selection under panmixia; or the single geno- 
type, which is the most probable basis under self-fertilization or vegetative 
multiplication. The present analysis indicates that this most favorable basis for 
evolutionary advance of the species as a whole may be present under certain 
conditions in a continuous population as well as in one consisting of partially 
isolated groups. 

SUMMARY 

Formulae are derived relating the variance of the gene frequencies of sub- 
groups (02) to the effective population number of these (N), the effective 
proportion of replacement per generation by immigrants (m), the inbreeding 
coefficient of individuals relative to the total population (F), and the mean 
gene frequency in the latter (q:). Thus o2=qi(1—qt)/[2N—(2N—1)(1—m?)] 
=q:(1—q:)F/(1—m)* including the immediate sampling variance, but 
0, = qt(1—qz)F excluding this. 

The effect of isolation by distance in a continuous population in which there 
is only short range dispersal in each generation is worked out on the hypothesis 
that the parents of any individual may be treated as if they were taken at 
random from a group of a certain size (N). It is shown that the inbreeding co- 
efficient of individuals in such a population relative to a population of size KN 
can be expressed in the form F =). K~"t/[2N—)>_K~'t] where >t is the sum of a 
series of terms in which t;=1 and tx = tx—1)[(x— 1) N—1]/xN or approximately 
C[x— (1/2N)]~"N+”/N where C is a constant close to 1. The value of >t can 
be obtained sufficiently accurately by actual calculation of the first few terms, 
supplemented by the approximate formula 


S t 1 \c1/N I 1 \-1/N 
t = CN/{ K, —- — - — —{K,-——- — 
Ky I( 2 =) ( ; 2 =) | 


for later terms. The limiting value )_/t is N. Thus F approaches 1 in an indefi- 
nitely large continuous population. 
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The preceding results apply to area continuity. With continuity in a linear 
range (for example, shore line), F=) t/[2N—)_t] as above, ti=1 but tx 
= tix» [N/x— 1—1]/Nv/x or approximately Ce? VE/N(/x—(1/2N)]7 8 tO™, 

In a continuous population with exclusive uniparental reproduction, the 
correlation between adjacent individuals is of the form E=)_t/N where >_t 
is the same as above for area or for linear continuity as the case may be. 

The variance of gene frequencies in subdivisions of any size, Nj, within a 
more comprehensive population N; is given by the formula oF 1=Ge(1—G) 
[F,—Fi]/[1— Fi] where F; and F;, are the inbreeding coefficients relative to the 
populations of size Nj and Ni, respectively. 

It is shown that in the absence of disturbing factors, short range dispersal 
(N less than 1oo in the case of area continuity) leads to considerable differ- 
entiation not only among small subdivisions but also of large ones. Values of 
N greater than 10,000 give results substantially equivalent to panmixia 
throughout a range of any conceivable size. With linear continuity, there is 
enormously more differentiation than with area continuity. There is somewhat 
more differentiation under uniparental than under biparental reproduction. 

Recurrent mutation, long range dispersal and selection are factors that re- 
strict greatly the amount of random differentiation of large (but not small) 
subdivisions of a continuous population. A term (1—m,)** under biparental, 
(1—m,)* under uniparental, reproduction is introduced into the expressions 
for t referred to above. In this m:=[—Aq/(q—q)] where Aq is the rate of 
change of gene frequency (q) which such factors tend to bring about. 

The effective size of a population characterized by the inbreeding coefficient 
F depends on whether F is due to a tendency toward mating of relatives not 
associated with territorial subdivision, or to such subdivision. In the former 
case the sampling variance is oj,=q(1—q)(1+F)/2N, in the latter, q(1—q) 
(1—F)/2N, in contrast with q(1—q)/2N in a random bred population. 

If different regions are subject to different conditions of selection, the 
amounts of both adaptive and nonadaptive differentiation depend on the 
smallness of m (if subdivision into partially isolated “islands”) or of N, size of 
the random breeding unit (if a continuous distribution). If these are sufficiently 
large there is no appreciable differentiation of either sort; if sufficiently small 
there is predominantly adaptive differentiation of the larger subdivisions with 
predominantly nonadaptive differentiation of smaller subdivisions superim- 
posed on this. Even under uniform environmental conditions, random differ- 
entiation tends to create different adaptive trends in different regions and a 
process of intergroup selection, based on gene systems as wholes, that presents 
the most favorable conditions for adaptive advance of the species. 
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THE DISTRIBUTION OF LINANTHUS PARRYAE 


PLING and DoszHANnsky (1942) have recently published a very detailed 

account of the distribution of two alternative characteristics, blue and 
white flowers, of a diminutive annual plant, Linanthus Parryae, in a portion 
of its range in the Mojave desert. It seemed of interest to make an analysis of 
their data from the standpoint of the theory of isolation by distance discussed 
in the preceding paper (WRIGHT 1943). 

The region in question is about 80 miles long and averages about 10.5 
miles wide. It stretches in an east-west direction along the piedmont north of 
the San Gabriel and San Bernardino Mountains and is largely isolated from 
other populations of the species. Data were obtained at stations every half 
mile along the principal roads, including two or three parallel roads at most 
places. At each station if the plants were present, counts were made of four 
samples of 100 plants each, spaced at intervals of approximately 250 feet at 
right angles to the road. 

The vegetation in this piedmont belt is stated to be homogeneous. A number 
of species of shrubs are listed as characteristic. “The spacing of these shrubs is 
wide, and it is doubtful if they cover as much as 60 percent of the ground. 
Around the base of most of the bushes, the soil has accumulated so as to form 
a slight mound. Linanthus Parryae occupies the ground between the mounds, 
forming a widespread reticulum which is interrupted only by the stream beds 
or depositions alluded to above.” The authors find nothing to suggest any 
selective differential. 

Some of the conclusions reached by the authors are given as follows. “The 
apparent complexity of the distribution pattern of white and blue flower color 
in Linanthus Parryae can be reduced to a relatively simple scheme. The blue 
was found principally in three or four ‘variable areas.’ Outside these areas the 
blue was encountered sporadically, as would be expected if it were introduced 
there only on rare occasions through mutation or through occasional transport 
of ‘blue’ pollen or seed. Within the variable areas, the white and blue occurred 
side by side, and the population was differentiated into an extremely fine 
mosaic of microgeographic races. Pure white and pure blue colonies occurred 
at distances as small as 500 feet. Nevertheless, populations found one mile or 
iess apart, resemble each other more than do populations taken at random in 


* A portion of the cost of composing the mathematical formulae is borne by the Galton and 
Mendel Memorial Fund. 
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the variable areas.” This is followed by comparison with the U-shaped dis- 
tribution of gene frequencies which the present author has deduced as char- 
acteristic in effectively small populations. 


THE HIERARCHY OF SUBDIVISIONS 


For more detailed mathematical analysis, it is convenient to define a hier- 
archy of subdivisions. Six compact, approximately equal, primary subdivi- 
sions are recognized along the length of the range. Each of these includes five 
secondary subdivisions. Each of these in turn includes four tertiary subdivi- 
sions. The tertiary subdivisions were chosen so as to include three stations as 
far as possible (two stations in six cases, four stations in two cases). The sta- 
tions, as noted above, typically include four samples, but many of them con- 
tain less. 

















SMALLER PRIMARY SUBDIVISIONS (EAST TO WEST) bie 
SUBDIVISIONS | I II I IV Vv VI 
Secondary | 5 5 5 5 5 5 30 
Tertiary 20 20 20 20 20 20 120 
Stations 57 59 60 60 61 59 356 
Samples 198 211 214 214 218 203 1258 





The population density in 1941 was found to vary from 1 to 26 per square 
foot (average 9.7) in the variable areas and from 1 to 48 per square foot (aver- 
age 7.4) in the predominantly white areas. The area occupied by an average 
sample of too of the plants may thus be taken as about 12 square feet. It is 
stated, however, that in unfavorable years the species may be found only in 
sparse concentration or abundant only locally. Since the effective size of a 
population depends much more on the number of productive individuals in 
unfavorable than in favorable years, it is probable that the typical density is 
very much less than roo per 12 square feet. 

The average distance between samples at a typical station (four samples 
spaced at 250 foot intervals in a line) is 417 feet. Thus a station may be con- 
sidered as representative of a circle of about this radius and hence of an area 
of about 5.4105 square feet (41771) or 0.020 square mile. A station would 
contain about 45,000 sample areas if these were closely packed. However, 
since it is stated that about 60 percent of the ground is occupied by other 
vegetation, this estimate is to be reduced accordingly. Using round numbers, 
this indicates that there were about 2X10‘ sample areas in the population 
represented by a station in 1941. But if unfavorable years are taken into 
account, this number would probably have to be reduced enormously. 

The average distance between stations in a group of three at half mile inter- 
vals is two-thirds of a mile. A tertiary subdivision may thus be considered to 
be representative of an area of about 1.4 square miles (=(§)*x) and thus 
would contain about 70 station equivalents. No doubt there should be some 
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reduction to allow for interruptions. We shall use 50 as a round number for 
the station equivalents in a tertiary subdivision. 

The secondary subdivisions typically include 12 stations spaced at half mile 
intervals, often along a straight road, but more often involving roads at right 
angles to each other or two close parallel roads. If along a straight line, the 
average distance between included stations is 2§ miles. We shall consider a 
secondary subdivision to represent an area of about 14 square miles. 

The area occupied by the entire population is about 840 square miles. The 
average area of one of the six primary subdivisions is thus about 140 square 
miles. If each of these were completely filled by its five recorded secondary 
subdivisions, the average area represented by each of the latter would be 28 
square miles, just twice that estimated above. There were, however, large 
territories far from the roads which were not sampled. The smaller estimate 
accordingly seems preferable. These estimates are summarized below on the 
basis of the numbers in 1941 and on an arbitrary hypothesis. 


TABLE I 


The hierarchy of subdivisions. 











ESTIMATED NUMBER OF UNITS 
AREA 
PLANTS IN 1941 (A) 1941 (B) ARBITRARY 
Total population 840 = sq. mi. 6X 101° 6X 108 6X 108 
Primary subdivisions 140 sq. mi. 1010 108 108 
Secondary subdivisions 14 sq. mi. 109 107 10 
Tertiary subdivisions 1.4 sq. mi. 108 108 104 
Stations ©.02 sq. mi. 2X 108 2X10! 200 
Samples - sq. ft. (A) 100 I I 
1200 sq. ft. (B) 





It would appear that the total number of plants of Linanthus Parryae in 
this region was between 10! and 10" in 1941. The average effective size of 
breeding population over a period of years is probably much less. For the sake 
of comparison, two widely different hypotheses will be used for the area from 
which the parents of individuals are drawn: (A) the area occupied by 100 
plants in 1941 (12 square feet on the average), (B) an area 100 times as large 
to allow for years in which the population is sparse. Estimate (A) would be 
practically the minimum possible even if 1941 were a typical year. Estimate 
(B) is quite arbitrary. 


THE SIGNIFICANCE OF DIFFERENTIATION WITH STATIONS 


The first statistical question that requires consideration is whether or not 
there are greater differences among samples from the same station than are 
expected from the accidents of sampling. Let p be the actual but unknown 
frequency of blue in a homogeneous local population. Let po be the observed 
frequency in a random sample of N individuals. Let p=po—p. Then o4,* 
=p(1—p)/N. This may be estimated from the observed frequency by apply- 
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ing the usual Gaussian correction after substituting po for p. Thus for L 
samples from the same homogeneous population 


i : 
3 > po(t — po) = — _* (p + 5p)(1 — p — 5p) 
1 


a =p(ii-—p)—o% if L=o 
= (N = 1)o}p 
(2) Op» = Po(t — po)/(N — 1) 


taking po(1—po) as the best estimate of the theoretic mean, obtainable from 
a single sample. 

In a group of K samples of N each, not necessarily drawn from a homo- 
geneous population, and with observed variance op,2= >, 1(Po—po)?/K 


(3) os = = DX [po(t — po)/(N — 1)] 
(4) os = [po(t — po) — 03, |/(N — 1). 


The deviation of the frequencies, po, of samples about the unknown actual 
frequency, p, of the whole heterogeneous population may be analyzed into 
two independent components, the deviation from the observed mean frequency 
po of the K samples and the deviation of this from p 





(5) (po — Pp) = (Po — po) + (Po — Pp) 
2 I L K 
(6) F(po—p) = . Z, u (Po as Bo). /K+ Few 5)/K 
1 
K cote Oe I L 2 
(7) — F(pp—p) = Tpo- 
K (rE ) L u Pp 


If the observed variance o,,? be treated as representative of the theoretical 
average ) .0,,2/L, we obtain the formula with ordinary Gaussian correction for 
the uncertainty of the mean. This, however, will not do in this case, because 
op, is not independent of po. If, for example, the frequencies of blue in four 
samples of 100 plants are 100, 100, 0, 0, respectively, the observed variance, 
op,’ =}, is the theoretical maximum, and application of the Gaussian correction 
gives the impossible estimate o*(,,5)=4. Since o,,2 necessarily approaches o 
as Po approaches either o or 1, assume as a first approximation that o,,” 
= Cpo(1— po) where C is a constant for a given group of samples. Then 


K — a Cc L 
(8) qx er FT u Po( — po) 
CL 
(9) = Cp —- — Dp’ 
Li 
‘ 
(10) But © te-3) = p Be - > and also © te-7)/K. 
1 
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K = - = = it 2 
(11) Thus KB) = Cp(1 — p) - Ror 
(12) oipe-2) = KCP(x — p)/(K + C — 2). 


The best estimate of p(1—p) is po(1 —po). An approximation for the variance 
of sample frequencies, corrected for uncertainty of the mean is thus 


(13) @tp-z) = Koy,/(K + C — 1) where C = o54/po(1 — fin). 


It may be noted that if C is small there is an approach to the ordinary 
Gaussian correction, but if C=1 (as in the extreme case cited above) there is 
no Gaussian correction at all, and impossible estimates are avoided. 

This variance includes the sampling errors in the determination of the local 
frequencies as well as the variance due to real differentiation of local popula- 
tions. To obtain an estimate of the latter, the mean sampling variance, oi as 
given by (4) must be subtracted. 


= Ko,,” v Po(t — Po) — op,” 
(K + C — 1) (3 = 4) 





(14) op 


While this seems to be as good an estimate as it is practicable to obtain from 
a single group of samples, it has serious limitations if po is close to o or 1. 
There is only one type of distribution in a group of four samples of 100 plants 
each for which pp=.coz5—namely, samples with the frequencies 0, 0, 0, I. 
Formula (14) gives very nearly o,2=o0 (exactly if C is treated as 0), but ob- 
viously no information is given (or can be given) on differentiation among 
samples from stations for which p (as opposed to po) is .oo25. Again, in such 
a station as 103 with frequencies 0, 0, 0, 5 the estimate of o,” is the maximum 
possible from a group of four samples with mean po=.o125 but is much less 
than might occur among stations for which p=.o125. The estimate from po 
and o,,” is satisfactory for values of po less than .25 or greater than .75 (if four 
samples) only if the extreme type of distribution (0, 0, 0, n) is rare. In the 
present data, distributions of this extreme type are abundant below po=.05 
and above po=.95 (24 in 40 stations, excluding those in which all plants or 
all but one were alike). There were 61 stations in which there were from 5 to 
95 percent blues. Among these, only two showed the most extreme possible 
differentiation for their average (namely, stations 137 with 0, 0, 30, o blues 
and station 371 with o, 100, 100 blues). The following estimates are based on 
these 61 stations. 


NO. OF SAMPLES NO. OF ESTIMATED ESTIMATED 
PER STATION STATIONS o,* ep 
2 8 +0304 -OO12 
3 16 .0704 .0013 
4 37 .0269 .O0017 


Total 61 .0400 .OO15 
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The estimates of o,” varied enormously. In one case (station 292) the esti- 
mated variance was less (by an insignificant amount) than that expected from 
the accidents of sampling. At the opposite extreme were stations 371 (0, 100, 
100) and 27 (0, 99, 100), which were largely responsible for the high mean 
estimate of o,” in stations with three samples. On the average, the total esti- 
mated variance is about 28 times that expected from accidents of sampling. 
There is thus no doubt of the reality of the differentiation among samples 
from the same station. 


SIZE OF THE PARENTAL POPULATION ON FOUR HYPOTHESES 


Of greater interest for our present purpose is the variability of gene fre- 
quencies. Unfortunately this depends on the answers to a number of questions 
which could be obtained only by experiments which have not yet been made. 

It is conceivable that the difference between blue and white is not genetic 
at all, but this is highly improbable. Assuming that blue and white differ 
genetically, it makes a difference whether there is exclusive cross pollination, 
exclusive self fertilization, or some intermediate condition. While self pollina- 
tion appears improbable, we shall consider this possibility as well as that of 
cross pollination. Under exclusive self pollination, the mode of inheritance 
makes no difference in the distribution of the blue and white ciones. If, how- 
ever, there is cross pollination, the estimates of gene frequencies from observed 
phenotypic frequencies depend on the mode of inheritance. We shall consider 
three extreme hypotheses—namely, that blue is recessive, that it is dominant, 
and that it depends on multiple factors and a threshold. 

The primary purpose of the analysis will be to find the effective size of the 
population from which parents (of adjacent individuals in the case of ex- 
clusive self fertilization) must be drawn to account for the observed distribu- 
tion as a cumulative consequence of sampling, according to the theory of 
isolation by distance recently presented (WRIGHT 1943). The possibility that 
the distribution may be affected by differential selection must also be exam- 
ined. We shall consider first the differentiation demonstrated above to occur 
within stations and after this the differentiation among larger populations. 

If there is exclusive self fertilization, we are concerned with the quantity 
E=<o,"/p(1—p), which measures the correlation between adjacent plants rela- 
tive to the population of the station (WRIGHT 1943). In this formula, p is the 
mean, go,” the variance of the frequency of blue among the unit populations 
from which adjacent plants are drawn. However, since these populations are 
unknown, we can only calculate E from the samples of roo plants. This has 
been done separately for each of the 61 stations in which po was between .o5 
and .g5, using formula (14) to estimate o,” in each case. The values of E ranged 
from 0.00 to 1.00 in a very asymmetrical distribution. The distribution of \/E 
was more nearly normal. 

If the hypothesis of exclusive self fertilization is correct, the variation of 
VE should be independent of jio. This was tested by calculating the correlation 
and regression coefficients. The statistical constants are given in table 2. It 














LOCAL VARIABILITY IN LINANTHUS 145 


turns out that the regression coefficient of \/E on po, .22+.10, is slightly more 
than twice its standard error. This is not in good agreement with the hypothe- 
sis, although it cannot be considered to eliminate it. 

The effective size of the parental population may be estimated from the 
average value of E. In these 61 cases the average value was .210. (It may be 
noted that this is considerably larger than the square of the average value of 
VE given in table 2 because of the great variability among stations. In fact 


E=(VE)*+oy_=.155+.055=.210.) 





TABLE 2 


__ Statistical constants under four different genetic hypotheses. po,q and mi are the stations means, and 
po, q and ~ are the means of these means for the group of stations considered. In the first column, b is the 
regression of »/E on po and r is the correlation between these variables. Similarly b and r refer to the 
corresponding variables in the other columns. 














CROSS FERTILIZATION 














SINGLE GENE DIFFERENCE MULTIPLE ADDITIVE 
(1) SELF FERTILIZATION FACTORS AND THRESHOLD 
(2) BLUE (3) BLUE 
DOMINANT RECESSIVE 
RANGE (po) .05t0.95 RANGE(q) .10 to .go .10 to .go RANGE(m) —1.85 to +1.85 
NO. 61 NO. 47 63 NO. 57 
Po -440 q .380 «S87 m —.212 
o> -254 oq 202 -245 om -932 
VE +394 VF .380 -373 VF - 399 
ONE - 234 o VE .256 .229 C VF +232 
r +.24+.10 r +.56+.10 —.21+.12 r +.76+.13 
b +.22+.10 b +.7o+.16 —.20+.12 b + .040+ .033 
E .210 F .210 .192 F ats 
N (A) 45 N (A) 25 27 N (A) 25 


N (B) 25 N (B) 14 I5 N (B) 14 





If for the moment we assume that the samples correspond in size to the 
groups from which the parents of any individual are drawn (hypothesis (A)) 
an estimate can be made of the effective population number of these groups 
from the theory of area continuity (WRIGHT 1943, fig. 7). Under this theory it 
requires a parental population of about 45 to give a value of E of .210 ina 
total (station) including 2X 10‘ of these unit populations. This can agree with 
the actual number of plants in a sample (100) on taking account of the fact 
that these varied enormously in productivity. However, as noted, 1941 was 
a year of exceptional abundance. On the average it might require a consider- 
ably larger area than indicated in this year to provide an effective population 
number of 45. But in this case there would be less than 2X 10‘ such groups in 
the area represented by a “station,” which in turn would require a smaller 
estimate of the population number of the parental group. If, for example, a 
station includes only 200 parental groups (hypothesis (B)) instead of 20,000, 
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the effective population number of these under the theory would be about 25. 

Consider next estimates under the hypothesis of prevailing cross pollination 
with blue dependent on a single differential gene. If blue is dominant, its gene 
frequency in a sample is given by q= 1—+/1—po. A variation, 6p in phenotypic 
frequency, implies the variation 5q= ép/2\/1—p in gene frequency. Thus the 
sampling variance of gene frequencies is given by the formula 


— K Tsp- | 

os ol K ie — Po) 

7 >| Po(i — Po) |- Do 
1 La( 


N—1)(1—p)J 4(N—1)_ 


(15) 


| 





a 


The mean gene frequency, G, and the variance, o,”, were estimated for each 
station by the following formulae in which N was 100 and K usually 4. 


z K 
a tins 
(16) q 25 


Koy,” Po I K 
17)  o2 = —————_- - —— where o,,.2 = — 2—@G 
OW) a! een wel way dhag-ada 





and CC = o4,2/G(1 — 4). 


The quantity F=o,?/q(1—4) (slightly different from the provisiona] quan- 
tity C) should be independent of G on a valid hypothesis. This was tested by 
calculating the regression of \/F on @ for all cases in which G was between .10 
and .go (47 in number). There turns out to be a highly significant positive 
regression +.70+.16. This means that there is a.great deal too much varia- 
bility at stations in which blue is common as compared with that in stations 
in which it is rare to be compatible with this hypothesis. 

The average value of F on this hypothesis is .210 which would imply a 
parental population of about 25 if it is assumed that there are 2X10‘ or more 
of these in the area represented by a station (hypothesis (A)). However, if 
there are fewer per station, the estimate must be decreased. If, for example, 
there are only 200 parental populations per station area (hypothesis (B)) the 
estimate of effective size is about 14. 

If blue is assumed to be recessive, its gene frequency in a sample is given by 
q=~Po. Phenotypic variation dp implies 5q= 5p/2\/p and mean sampling 
variance o%q?=(1—po)/4(N—1). There were 63 stations for which q was 
between .1o and .go under this hypothesis. The values of F, \/F, and o,? in 
each of these were estimated by formulae analogous to those used under the 
hypothesis of dominance. 

It turned out that the regression of \/F on q is negative, —.20+.12. There 
is not enough variability within stations in which blue is common to satisfy 
this hypothesis in contrast with the situation if blue is assumed dominant. 
The regression is less than twice its standard error, however, so that the data 
cannot be considered to be incompatible with recessiveness of blue. 


LOCAL VARIABILITY IN LINANTHUS 147 


The average value of F on the hypothesis of recessiveness is .192 and thus 
only slightly different from that on the hypothesis of dominance. The estimates 
of effective size of parental group are accordingly practically the same (27 
under hypothesis (A), 15 under hypothesis (B)). 

The third hypothesis with respect to the mode of inheritance, assuming 
cross fertilization, is that the alternative characters biue and white depend on 
whether the cumulative effects of multiple factors (with no dominance or 
interaction) exceed a certain threshold. The common type of polydactyly in 
guinea pigs is an example of a character in which there is a superficial simula- 
tion of simple Mendelian heredity but in which data from F; and later genera- 
tions indicate the above mechanism (WRIGHT 1934). The methods used in the 
analysis of polydactyly may be applied except that the correction for the 
sampling error was given incorrectly. (Fortunately the correction was so small 
that this had no appreciable effect on the results.) 

Assume that the multiple factors determine a substantially normal distribu- 
tion on a primary scale but: that the phenotype depends on whether the value 
on this scale is above or below a threshold. It is convenient to take the thresh- 
old as the zero point and take as the unit of measurement the standard devia- 
tion (on this primary scale) characteristic of a sample in the station under 
consideration. The proportion above the threshold in a given sample with 
mean at m on the scale (and thus threshold at —m relative to the mean) is as 
follows in terms of probability integral,* 


I “1 ‘ 
pri x1 = — | e-* dx 
V 20 J _« 
(18) Pp 
(19) m = pri“! p. 


1 — pri (— m) = prim 


A phenotypic variation dp implies 5m= ép/y, where y is the ordinate of the 
unit normal curve at the threshold. Thus the sampling variance for m is 
Fsm? = Po(I—Po)/y2(N—1). This was found for each sample and averaged to 
find the correction to be applied in calculating on. 


K K 
Cm" = | 2m = nm >, m|/(K — 1) 
1 1 
(20) ; 
— © [pol — po)/y*]/K(N — 1). 
1 
To estimate the inbreeding coefficient F from om? it may be noted first that 


with multiple factors and no dominance or factor interaction, F measures the 
proportional decrease in the variance of characters within random breeding 


2 In a number of papers (WRIGHT 1926, 1934, etc.) prf x: has been used for the area of the 
probability curve between the mean and the deviation x;, as a multiple of the standard deviation. 
This has the disadvantage that the inverse function is two-valued unless areas below the mean 
are treated as negative. The form used above avoids this difficulty. The recognized form erf x: 
= (2//zx) [>'e*"dx is inconvenient for several reasons. 
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subgroups (WRIGHT 1921). The contribution of a pair of alleles A, a to the 
variance of a subgroup under the above conditions is 2q(1—q)(A)* where 
(A) is the effect of replacing a by A. The average contribution in an array of 
such subgroups is 2[4(1—G)—o,°](A)?. But o,2?=4(1—q)F, where F is the 
inbreeding coefficient of individuals relative to the total. Thus the average con- 
tribution of A, a to intragroup variance may be written 2(1— F)q(1—4)(A)?. 
The total intragroup variance is merely the sum of such contributions under 
the conditions. 


(21) og? = 2(1 — F) >) [q(x — G)(A)’). 


Next it may be noted that F measures the proportional increase in the 
variance of the fotal population resulting from subdivision into inbred lines. 
This total variance (o;,2) is compounded of the intragroup variance (o,”) given 
above and the intergroup variance (¢,,”). The contribution of A, a to the mean 
of each subgroup is of the form 2q(A). The variance of these is 40,?(A)? 
= 4Fq(1—q)(A)?. 

(22) Cm" = 4F >. [q(a a q)(A)?] 
(23) a? = og? + om? = 2(1 + F) >) [G(x — G)(A)?*]. 
(24) Thus F = om?/(20¢? + om?). 


In the present case, og2=1 by hypothesis. 
(25) F = Om?/(2 + Fin) 


Estimates of F have been made separately by this formula for each of 57 
stations in which m was between —1.85 and +1.85. Again \/F should be 
independent of mi if the hypothesis is valid. The regression of \/F on m turned 
out to be positive but not significant (+.040+.033). From this standpoint 
this hypothesis is satisfactory, more satisfactory in fact than any of the others 
considered. 

The mean value of F was .213, substantially the same as under the hypothe- 
sis that blue depends on a single dominant (F=.210) or a single recessive 
(F=.192). The estimate of the effective size of the parental group is accord- 
ingly approximately the same under all of these hypotheses: 25 to 27 if there 
are 2X10 or more such groups in the area represented by a station but less 
if the number of random breeding units in a station is less than 2X 104 (about 
14 or 15 if the number of groups is as small as 200). 

The effects of partial determination of the character by environmental fac- 
tors are obvious under the last hypothesis. If there are environmental effects 
on individuals, not related to their location (for example, change in color with 
aging), the genetic component of intragroup variance would be less than 1, 
but on? would not be affected. In this case, F is larger than estimated above, 
requiring N (size of random breeding unit) to be smaller to account for the 
observed differentiation within stations. In the case of Linanthus Parryae, the 
observations of EpLiInc and DoszHansky tend to rule this out. 

If, on the other hand, there are environmental influences (such as character 
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of soil) that make a difference between samples but rarely between individuals 
of the same sample, it is o,,? that must be reduced if it is to represent genetic 
differentiation and the estimate of F becomes smaller than calculated above. 
In this case, a larger value of N is implied. If intersample variance is almost 
entirely environmental, even though genetic segregation occurs within samples, 
F is almost zero, and it is implied that N is so large (for example, over 1000) 
that there is virtual panmixia throughout the station. There is little likelihood, 
however, that flower color is due to such environmental effects in Linanthus 
Parryae. 
ANALYSIS OF VARIABILITY WITHIN THE RANGE 

It is next of interest to analyze the variability in the region as a whole. The 
mean gene frequencies in the primary and secondary subdivisions were as 
follows, assuming blue to be recessive. 


TABLE 3 


Gene frequencies in the primary and secondary subdivisions assuming blue to be recessive. 














PRIMARY SECONDARY SUBDIVISION 
SUBDIVISION A B c D E — 
I -573 . 504 -717 .657 .302 -551 
II -339 .032 .007 .005 .008 .078 
Ill .009 .000 .000 .000 .000 .002 
IV .O10 -005 .000 .000 .068 -O17 
V -126 .004 .002 .000 .000 .027 
VI .000 .106 -411 .224 .O14 -I51 
Total -137 





The distribution on the map is shown in figure 1. This brings out the three 
separate centers of high frequency, one in I overlapping II, a second in VI 
and a third in the southern parts of IV and V, to which Epiinc and Dos- 
ZHANSKY called attention. 


=z 





Figure 1.—The geographical distribution of the frequency of the gene for blue flowers (if 
recessive) in the portion of the range of Linanthus Parryae investigated by EpLinc and Dos- 
ZHANSKY. The six primary subdivisions are indicated by Roman numerals. 


It will be convenient to symbolize the various levels in the hierarchy by sub- 
scripts: 1 for a primary subdivision, 2 for a secondary, 3 for a tertiary, 4 fora 
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station, and 5 for a sample. Thus q, represents the mean gene frequency in a 
station, K4, the number of recorded stations in a tertiary population, etc., 
o4.3;c)’ is used for the gross variance of q, (stations) within the next higher 
category (with the modified Gaussian correction for uncertainty of the mean, 
but without correction for the sampling variance of the next lower level); o4.3? 
is the final estimate including this last correction. Finally, o4..2(=04.3?+03.2" 
+o2,:°+01.?) is the variance of qs within the total population. 

The variance of samples within stations was obtained for all of the 356 





stations. 
356 
p [> = a>, qs| 
P 1 7.2523 
o°5-4(G) = rom = = .0076 
x " — 959-3 
> [Ks— 1+ Gi] F 
2 ; 2 
05-4 = .95795-4(G) = -0073. 


The summations in the bracket refer to the Ks (usually four) samples of a 
station. C,=.161 is the average value of C4(=042/G(1—G)) for the 63 sta- 
tions in which qy was between .10 and .go. The mean sampling variance of qs 
in these 63 stations was such that o5.4?/o5.4;a)? was .957. This ratio is carried 
over to all stations. 

The variance of stations within the 120 tertiary subdivisions was as follows. 


120 
D [DX a? — asd as] 
1 4.7112 


04.3(G) = = = .0176 





120 s ~~ 267.9 
Dd [Ks — 1 + Cs] 
1 
2 2 2 
04.3 = 04.3(G)— 35605.4(G)/1258 = .0176 — .0021 = .O154. 


The number of stations within tertiary subdivisions (Ky) was usually 3. 
There were 30 tertiary subdivisions in which qs was between .10 and .go. 
a $[% q? — gad *] 


C; = = .266. 
30 1 Ky4qs(1 — qs) 





Similarly the variance of tertiary subdivisions within the 30 secondary sub- 
divisions was as follows. 


y [>d = qe: qs| 





2 1 2.1386 
03.2(G) = = = = .0220 
- sae 97-1 
> [Ks -—1+ G] 
1 
2 2 2 
03.2 = 03.2(G) — 1 2004.3(G)/356 = .0220 — .0059 = .O16t. 


Here Ks was regularly 4. The average value of Ces, calculated from ten 
secondary subdivisions for which q2 was between .10 and .go, was .237. 
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The variance of secondary subdivisions within the six primary subdivisions 
was as follows. 


¥ [do a2? — qd al 








2 . 3218 
92:14G) = ; = s = .O129 
= .O 
> [Ke-1+C] 
1 
2 2 2 
02.1 = 92.1(G) — 93.2(G)/4 = .0129 — .0055 = .0074. 


Here Kz was regularly 5. There were three primary subdivisions with qu: 
considered large enough to warrant estimation of C,—namely, I, II, and VI 
with values of C; of .084, .237 and .182 and Ci =.168. 

Finally the variance of the six primary subdivisions within the total was as 
follows. 


6 6 
D a? — qed a 
1 1 











2 - 2107 
1-4(G) = /-14+c) = ‘ = .O414 
ee = esas - ants = .0414 — .0026 = .0388. 
Here > a? — aed 
C, = = .309. 


6qe(1 = qt) 


There is little doubt of the reality of differentiation at all of these levels 
except for the case of secondary within primary subdivisions in which 2.1)" 
is only 2.3 times the variance expected from accidents of sampling at the next 
lower level. The extreme departures from normality in the distribution of q, 
however, must be kept in mind. They make the applicability of FisHER’s 
(1938) z test somewhat dubious. 

The variance of the groups at each level within the total may be obtained 
by adding the variances down to the level in question. Thus o;.,2=.0388, 
02.42 = .0462, 03.42 = .0623, o4..°=.0777 and o5.,2=.0850. By dividing these ex- 
pressions by qr(1—qt) where q:=.137 we obtain values of o1.,°/qe(1—qz) etc. 
This expression has been shown to be equal to (F,—F\)/(1—F i) under area 
continuity, in the absence of disturbing factors (WRIGHT 1943). It is .717 for 
samples, .656 for stations, .525 for tertiary groups, .390 for secondary groups, 
and .327 for primary groups. In figure 2 the square roots of these figures are 
plotted against number of random breeding units and compared with the 
theoretical curves for N=10 and N=20 deduced in the paper referred to 
above. The number of random breeding units is as given in table 1 using the 
arbitrary assumption (B) that there are 200 of these to a station. 

From inspection of this figure it appears that there is considerably more 
variability of the higher categories than expected from that within stations. 
It would require a random breeding unit of considerably less than ten to 
account for this variability in contrast with about 15 indicated by the varia- 
bility within stations on the assumption made above. If the samples are con- 
sidered to be the random breeding units (2 X10‘ per station equivalent) the 
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discrepancy is greater, since the variability within stations indicates a random 
breeding unit of about 27 on this hypothesis, while the figures for the higher 
categories are only slightly modified. 
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FicurE 2.—The small circles connected by broken lines indicate the calculated variability of 
the frequency of the gene for blue (if recessive) among subgroups (samples, stations, tertiary, 
secondary and primary subdivisions of the range studied). Comparisons are made with the theo- 
retical variability on the hypotheses that the effective number of individuals in a random breeding 
unit is 10 or 20, and that long range dispersal or mutation is negligible. 

F1GURE 3.—The triangles indicate the calculated variability of the gene frequency of blue (if 
recessive) among subgroups of the primary subdivision II. The squares and circles do the same 
for primary subdivisions VI and I, respectively. These are compared with the theoretical amounts 
of variability on the hypothesis that N=10 and that there is long range dispersal or mutation 
at rates up to m=10~+. 


The most serious discrepancy is in the great variability of the primary sub- 
divisions. In this case, however, the comparison with theory is hardly a fair 
one. The elongated range along the piedmont would favor a greater amount 
of differentiation than indicated by the theory of area continuity. There is 
some approach to the conditions of linear continuity. 
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TABLE 4 

Analysis of variability in six primary subdivisions (I to VI). The numbers in column 1 designate 
the level in the hierarchy. Column 2 gives the number of groups at each level included in the primary 
subdivision. Column 3 gives the sum of squared deviations of q from the mean of the next higher cate- 
gory. This is divided by the entries in column 4, > (K—1+C) to give the entries in column 5, which 
are gross intragroup variances not corrected for the accidents of sampling. The sampling variances 
are given in column 6. Subtraction of these from the entries in column 5 give the net intragroup variances 
of column.7. The running sum of these (column 8) gives the variance of the groups at each level within 
the primary subdivisions. In column g these are divided by q,(1—@1) where q; is the mean gene fre- 
quency for the primary subdivision (table 3) to give the estimate of the quantity (F,;—F;)/(1—F)). 
The square roots of these quantities are the ordinates in figure 3. Column 10 gives the ratio of the gross 
variance (column 5) to that expe. :zd from sampling (column 6). 








(1) (2) (3) (4) (s) (6) (7) (8) (9) (10 
I 2 5 - 1037 4-2 +0249 .0172 .CO77 .0077 .031 1.4 
3 20 I.1140 16.2 .0688 .or61 .0527 .0604 -244 4-3 
4 57 1.9389 42.3 .0458 .0072 -0386 .0990 -400 6.3 
5 198 3-7814 150.2 .0252  .0009 .0243 .1232 .498 _— 
II 2 5 .0854 4-2 .0205 .0062 .O143 .O143 -199 3.3 
3 20 - 3990 16.2 .0246 .0069 -0178 .0321 -446 3-6 
4 59 -8993 44.3 .0203 .COI7 .0186 .0507 . 704 II.9 
5 211 -9825 161.5 .0o061 .0002 .0059 .0565 . 786 _— 
Iii 32 5 .OOOI 4.2 .0000 .0000 .00CO .0000 .c06 — 
3 20 .0003 16.2 .0000 = .0000 .0000 .00CO ° — 
4 60 .0037 45-3 .0COOI = .0001 .0000 .©000 ° — 
5 214 .0675 163.7 .0004 .0000 .0004 .0004 .213 — 
IV 2 5 .0034 4.2 .0008 .0006 .0002 .0CO2 .O14 1.3 
3 20 .0382 16.2 .0024 .0025 (—.000I1) .0oocor .CO4 °.9 
4 60 - 3440 45-3 .0076 .0007 .0069 .0070 -428 10.8 
5 214 - 4094 163.7 .0025 .OOOI .0024 .0094 -576° — 
V 2 5 .O125 4.2 .0030 .0005 .0025 .0025 .096 5.8 
3 20 0339 16.2 .002I .0025 (—.0004) .0021 -o81 0.8 
4 61 - 3484 46.3 .0075 .OOII .0065 .0085 331 7.0 
5 218 -6391 166.8 .0038  .Ooo: .0037 -O122 -473 _ 
VI 2 5 .1167 4.2 .0280 =. . 0085 .O195 .O195 .2g8 3:3 
3 20 -5533 16.2 .0342  .0090 .0252 .0446 348 3-8 
4 59 1.1768 44.3 .0266 .0026 .0240 .0686 -535 10.2 
5 203 1.3724 153-5 .0089 .0003 .0086 .0772 -603 _ 





ANALYSIS OF VARIABILITY WITHIN THE PRIMARY SUBDIVISIONS 


Because of the above consideration it is desirable to analyze the variability 
within the primary subdivisions. This has been done, with the results presented 
in table 4. Judging from the ratio of the gross variance to that expected from 
sampling (column 10), there appears to be significant differentiation at all 
levels within the primary subdivisions in which blue was not rare—namely, 
I(q=.551), I1(G=.078), and VI (G=.151) with one exception. The exception 
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is the absence of significant differentiation of secondary subdivisions in I. In 
addition, there was significant differentiation among secondary subdivisions 
in V and of stations in both IV and V in spite of the rarity of blues in these 
areas (G=.027 in V, G=.017 in IV). The data from III, which was almost uni- 
formly white flowered (G=.002), are wholly inadequate for any estimates. 
The significance of differentiation among samples within stations has not been 
determined separately for the primary subdivisions. As noted, the variance in 
stations in which G was between .10 and .go was about 24 times that expected 
from the accidents of sampling and undoubtedly significant. 

In figure 3 the values of ox,+/» ‘ar (1 —q) as estimates of V(Fi—Fi)/(a —F;) 
under the theory of area continuity are plotted against the estimates of the 
number of random breeding units included in the category in question, within 
primary subdivisions I, II and VI. Again it is assumed arbitrarily that there 
are 200 random breeding units in a station. 

The curves for the three primary subdivisions agree with each other as well 
as can be expected. They are somewhat more nearly parallel to the theoretical 
curve for N= 10 than when differentiation was considered relative to the entire 
range. There is still, however, more variability of the higher categories at least 
within II and VI than expected from the theory of area continuity. Again the 
discrepancy would be much more serious if the random breeding units are 
identified with the 1941 samples, 2X 104 to a station. 

There are other factors, however, that must be considered. Even if there are 
only 200 random breeding units per station, the number in one of the primary 
subdivisions is of the order 10° (and in the whole range considered here, 6 X 10°). 
This would be the typical number of generations to common ancestors of 
individuals that are far apart if the theory of area continuity applies strictly. 
In this case it would require something like a million years for a local colony 
to spread over this area, which is certainly highly improbable. However, means 
of dispersal to great distances so rare that only a minute fraction of the popu- 
lation of any occupied region has such an origin would enable the species to 
spread over a large suitable range in a few years. On the other hand, the effects 
of even a minute amount of replacement by a random sample of the species 
are not negligible. 

It was shown in the preceding paper that such replacement in the proportion 
m (whether due to long range dispersal or mutation or, as accurately as pos- 
sible, of uniform selection) removes nearly all random differentiation of popu- 
lations more than 1/m times the random breeding unit and considerably re- 
duces such variability in populations one tenth of this size. Thus reversible 
mutation between blue and white at rates of the order of 10~* per generation 
should practically eliminate random differentiation of primary subdivisions 
and somewhat reduce that of secondary subdivisions, even under hypothesis B 
(in which these are 10° and 10° times the random breeding unit respectively). 
Admixture of a random sample of the species into all populations at the rate 
10‘ per generation would practically eliminate all random variability of 
tertiary and larger subdivisions under the same hypothesis. Under hypothesis 
A this would eliminate random variability even among stations. The varia- 
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bility of samples within stations, however, would not be affected under hy- 
pothesis B and not very much under hypothesis A. The expected variability 
at each level in the hierarchy is shown in figure 3 for N=10 and m=10-4, 
10, 10° § and m<107~”, 

The large amount of differentiation actually found at all levels up to the 
highest indicates that some other factor than mere accumulation of sampling 
differences has been at work. 

One possibility is that there are irregularities in the distribution, including 
differences in density. This would cause a greater amount of random differ- 
entiation of the larger categories than expected under a uniform distribution. 

The most obvious possible factor that could counterbalance the effects of 
long range dispersal and mutation, however, is differential selection. Mr. W. 
HOovANITZ, in a personal communication, suggests that the climatic conditions 
may differ sufficiently near the ends of the region studied (I and VI, in which 
blue was relatively common) from those in the middle (where blue was rare) 
to make such an interpretation plausible. It is less plausible for the differences 
among secondary and tertiary subdivisions of the same primary subdivision. 

There is a possibility, however, of selective differentiation even in the ab- 
sence of any environmental differentiation. As noted in previous papers, the 
random differences in gene frequency, occurring in all series of alleles up to a 
certain level in the hierarchy, create a unique genetic system in each locality. 
Slightly different adaptive systems may be arrived at in different localities. 
If the gene or genes which distinguish blue and white play a role in any such 
systems, this would give a basis for locally different selection pressures. 

The distribution of blue and white can be accounted for most easily by sup- 
posing that most of the differentiation of the smaller categories is random in 
character and due to the accumulation of sampling accidents in random breed- 
ing groups of one or two dozen productive individuals per year but that at the 
higher levels, processes which tend to pull down random differentiation such 
as mutation and especially occasional long range dispersal are counterbalanced 
by selective differentials between local genetic systems. 


SUMMARY 


The detailed account of the distribution of blue and white flowers of the 
annual plant Linanthus Parryae in a region of the Mojave desert by EPLING 
and DoBzHANSKY provides interesting material for comparison with the theo- 
retical amount of random differentiation in a population that is continuous 
but in which dispersal is severely restricted. 

For this purpose the 840 square miles studied is broken up into a hierarchy 
of subdivisions. There proves to be highly significant differentiation of samples 
(of 100 plants) within stations (representative of about 0.02 square miles). 
There is significant differentiation of stations within tertiary subdivisions 
(about 1.4 square miles), of these within secondary subdivisions (about 14 
square miles), of these within primary subdivisions (about 140 square miles), 
and very marked differentiation of the primary subdivisions along the some- 
what narrow piedmont zone. 
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Assuming that the difference is a genetic one, four hypotheses are considered 
in connection with the variability of samples within stations. It is improbable 
that reproduction is by self fertilization, but if it is, it would require that the 
population from which adjacent individuals are derived be about 45 to ac- 
count for the observed variability, accepting the density of population found 
in 1941 as typical. This, however, was an unusually favorable year. Arbitrarily 
assuming 200 units per station instead of 20,000 as indicated in 1941, an-effec- 
tive population number of about 25 per unit is indicated. 

If there is predominant cross fertilization, it makes no appreciable difference 
whether blue depends on a single dominant or a single recessive gene or on 
multiple factors and a threshold. Under any of these hypotheses, the effective 
population number of the random breeding unit comes out 25 to 27 if the 
1941 estimate of numbers is accepted and 14 or 15 if the area occupied by a 
parental unit is assumed to be 100 times as large. 

The amount of differentiation of the higher categories is somewhat greater 
than expected as a random consequence of that of the lower categories, under 
continuity of area and with negligible rates of long range dispersal and muta- 
tion (rates less than 10~? per generation) and no differential or other selection. 
This is especially true of the primary subdivisions, but here the theory is un- 
satisfactory because of the elongated character of the range, which favors 
excessive differentiation. 

However, there is somewhat too much variability of the higher categories 
even within the compact primary subdivisions to be accounted for as wholly 
random under the assumption above. As it is highly probable that the theoreti- 
cal values should be substantially reduced because of long range dispersal] at 
rates greater than 10° per generation, there is probably a counterbalancing 
influence of differential selection. This would not necessarily depend on differ- 
ential environmental conditions. It could be a by product of the development 
of different genetic systems by the process of random differentiation. 
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N CONNECTION with a search for linkage in human genetics, the authors 

had occasion to test blood samples from 40 families with 138 children for 
the Rh factor. Since only a single previous paper has been published on the 
heredity of the Rh factor (LANDSTEINER and WIENER 1941), it seemed desir- 
able to report these results at the present time. 


MATERIAL AND METHODS 


The blood was obtained from the finger and collected in small test tubes, 
using as an anticoagulant one drop of 3.8 percent dihydric sodium citrate for 
every ten drops of blood. Two percent suspensions of the red cell sediments in 
saline solution were prepared, and the A, B groups and M, N types were deter- 
mined. The tests for these agglutinogens were carried out on Boerner well 
slides, one drop of each cell suspension being mixed in four wells with one drop 
of group A, group B, anti-M, and anti-N testing sera, respectively. The re- 
actions were read both macroscopically and microscopically after mixing on 
an electric shaking machine for ten minutes. Since only sera of high titer were 
used for the tests, all positive reactions were readily visible to the naked eye, 
even blood with weakly reacting agglutinogens, such as A2B. All blood speci- 
mens of group A and group AB were further tested with absorbed B serum 
in order to determine the subgroup. (For further details as to technique see 
WIENER 1943.) 

The tests for the Rh factor were carried out with a human anti-Rh/ serum? 
from a patient who had given birth to an infant with erythroblastosis fetalis. 
This serum had previously been found to give parallel reactions with standard 
guinea pig anti-Rh/ serum on a series of more than 100 blood samples. Since the 
serum came from a group A individual, it was necessary to treat it with group 
B saliva to neutralize the 6 isoagglutinins (WIENER and FORER 1941; WIENER 
1942). It was found that satisfactory results could be obtained with this par- 
ticular human serum by mixing one part of the serum with one part of boiled 
B saliva and two parts of saline. In the tests, one drop each of fresh, washed 
2 percent blood suspension and diluted serum were mixed in small test tubes. 
The mixtures were incubated in a water bath at 38°C for one hour, after which 
time the reactions were read grossly by inspecting the sediment. The tubes 
were then centrifuged at low speed for one minute, gently shaken, and the 
agglutinogen reactions read macroscopically and under the low power of the 
microscope. 


1 Aided by a grant from the Carnegie Corporation and the Committee on Human Heredity of 
the National Research Council. 

2 For this serum the authors are indebted to Dr. Howarp A. SPINDLER and Dr. Donan H. 
KariHe_r. At the time this study was carried out no guinea pig immune anti-R/ serum was at hand. 
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RESULTS 


The results of the A-B, M-N, and Rh tests in the 40 families are presented 
in table 1. 

The hereditary transmission of the RA factor in these families is summarized 
in table 2. The present findings are in accord with those of LANDSTEINER and 
WIENER (1941) and support their theory that the Rh factor is inherited as a 
simple Mendelian dominant. In the only family in which both parents are Rh 
negative, all three children are also RA negative. As has been shown by LANpD- 
STEINER and WIENER, in the mating Rk+ X Rh+, approximately 7.9 percent 
of the children should be Rh negative in a population where the incidence of 
the Rh negative individuals is 15.4 percent. The number of Rk negative chil- 
dren observed, nine among 106 children or 8.5 percent, does not differ sig- 
nificantly from the expected value. In the mating, RhA+ X Rh—, the expected 
incidence of Rh negative children was about 22.9 percent. The observed fre- 
quency was higher, but the difference is not statistically significant considering 
the small number of families examined. 


TABLE 2 


Summary of family material. 














NUMBER NUMBER OF CHILDREN 
MATING OF 
FAMILIES Rh+ Rh—- TOTALS 
Rh+ X RA+ 31 97 9 106 
Rh+ XRh— 8 17 12 29 
Rh— XRh— I ° 3 3 
Totals 40 114 24 138 





No contradictions to the laws of inheritance of the blood groups, subgroups, 
and M, N types were encountered in the present study. The data concerning 
the subgroups of A may be of some interest because of the limited number of 
investigations that have been carried out up to the present time. 

With regard to the linkage relations of the Rh factor, blood groups, and M, N 
types, our data offer further evidence for the independent assortment of the 
genes determining these characters. In table 3 are summarized the pertinent 
family data from the present study as well as the significant families from the 
report of LANDSTEINER and WIENER. While the amount of material is still too 
small to permit a precise analysis, the results appear to support the conclusion 
that the genes Rh and rh are located on a different pair of chromosomes from 
genes A, B, O, and M, N. Accordingly, there are now readily available markers 
for at least four (including sex) of the 24 pairs of chromosomes for linkage 
studies in human genetics. 

While there is little difficulty in obtaining high-titered anti-A, anti-B, 
anti-M, and anti-N sera, difficulties in procuring active human anti-RA sera 
have hitherto hindered studies on that factor. Suitable human anti-Rh sera 
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give striking reactions, but they are difficult to obtain because the vast ma- 
jority of sera from patients who have had hemolytic reactions to the Rh factor 
or who have had babies with erythroblastosis fetalis give surprisingly feeble 
reactions and at times even no discernible reaction (WIENER 1941; LEVINE 
et al. 1941). The advantage of immune animal sera such as can be produced 
for the M and WN factors is therefore obvious. LANDSTEINER and WIENER 
(1941, 1942) have described methods of producing satisfactory sera of this 
type by immunizing guinea pigs with rhesus blood. 

In heredity studies, the chance of detecting linkage with blood group factors 
A and B are increased somewhat if tests are also made for the subgroups, since 
the number of significant matings is thereby increased. In a similar way, the 
existence of subdivisions of the Rh positive type increases the value of this 
factor in linkage studies. Tests for the subdivisions of the Rh positive type 
were not carried out in the present study because suitable sera were not at 
hand when the family material became available. 


SUMMARY 


Tests for the Rh factor in 40 families with 138 children support the theory 
that the Rh factor is transmitted as a simple Mendelian dominant. The 
agglutinogen appears to be genetically independent of the four blood groups 
and the M, WN types. 

The authors are indebted to PROFESSOR F. BERNSTEIN for permission to use 
the blood samples of families collected in a study of linkage analyses for 
various hereditary factors done together with Mr. SIDNEY FINKEL and Mr. 
HERBERT L. Borison. 
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INTRODUCTION 


HE species Drosophila pseudoobscura is differentiated into several major 

and numerous microgeographic races. Inhabitants of large portions of the 
distribution area can be characterized by the presence or absence or by relative 
frequencies of certain types of Y chromosome (DOBZHANSKY 1935, 19378, 
1939a), of gene arrangements in the third (DoBZHANSKY and STURTEVANT 
1938, DoBZHANSKY 1939a, and unpublished data) and in the X chromosomes 
(STURTEVANT and DoBZzHANSKY 1936 and unpublished data), and of various 
modifying genes (summary in DoBZHANSKY 1937b). Microgeographic races 
differ in the incidence of gene arrangements in the third and X chromosomes 
(DoBzHANSKY and QUEAL 1938, KOLLER 1939, DOBZHANSKY 1939b) and of 
concealed recessive lethals (DoBZHANSKY and WRIGHT 1941, WRIGHT, Dop- 
ZHANSKY, and Hovanitz 1942). Differences between microgeographic races 
are smaller than but similar in kind to those between macrogeographic races; 
it is reasonable to suppose that the latter are compounded from the former. 
The data to be reported in the present article show that the genetic composi- 
tion of populations of some localities changes with time. The changes appear 
to be connected with the annual climatic cycle and are consequently reversible. 
Nevertheless, since these changes demonstrate a great plasticity of the species 
genotype, they may be regarded as models of adaptive evolutionary changes 
on the racial and higher levels. 

It is a pleasure to acknowledge the support which this investigation has 
received from the CARNEGIE INSTITUTION OF WASHINGTON, the help of Mr. 
W. Hovanitz and Miss A. M. Hozz in conducting the experiments, and the 
kindness of PRoFESSOR CARL EPLING and of Mr. ALEXANDER SOKOLOFF who 
have done a good share of the collecting of the flies in California on behalf of 
the writer. PROFESSOR J. T. PATTERSON has very generously contributed 
population samples collected in Texas. Messrs. E. HELD, HARLAN LEwIs, 
and Bruce WALLACE have also assisted the work in various ways. 


MATERIAL AND TECHNIQUE 


Population samples were taken at more or less regular intervals at nine 
collecting “stations” grouped in three “localities” on Mount San Jacinto, 
California. A map and a brief description of the San Jacinto region is given in 
Wricut, DospzHANsky, and Hovanitz 1942. It should be kept in mind that 
one of the localities, Andreas Canyon, lies at an elevation of about 800 feet 
above sea level, the other, Pinon Flats, at about 4000 feet, and the third, Keen 
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Camp, at about 4300 feet. The stations within a locality are from 200 yards 
to 23 miles apart, and the localities 10 to 13 miles apart. Because of the differ- 
ences in the elevation and other factors, the population density in Andreas is 
highest in March-April and lowest in summer. The population density at 
Pinon is greatest in spring and autumn, and at Keen in early summer. No 
flies can be collected at Pinon and Keen in winter, and at Andreas the flies 
are very rare in late summer. 

Five gene arrangements in the third chromosome are regularly present in 
populations of Mount San Jacinto. For descriptions of these arrangements, 
Standard, Arrowhead, Chiricahua, Tree Line, and Santa Cruz, see Dos- 
ZHANSKY and STURTEVANT 1938. A sixth arrangement, a new one, has been 
found in a single chromosome at Keen D; it had arisen by mutation probably 
shortly before its detection, and in view of its rarity need not concern us here. 
Two gene arrangements are found in the right limb of the X chromosome: the 
Standard and the “sex-ratio.” Males carrying the latter produce when crossed 
to any females progenies which consist almost entirely of daughters (StuRTE- 
VANT and DOBZHANSKY 1936). 

Whenever abundant material was available, the gene arrangements were 
recorded in approximately 100 third chromosomes from each sample. The wild 
females, most of which are impregnated before capture, are placed singly in 
culture bottles and allowed to produce offspring. Examination of the salivary 
glands in a single F; larva suffices to determine the gene arrangements in two 
third chromosomes (one of maternal and one of paternal origin). Wild males 
are crossed to laboratory females homozygous for the Standard arrangements 
in the third and in the X chromosomes. The salivary glands of seven F; larvae 
are examined, which gives the probability of 63 out of 64 chances that both 
third chromosomes of the father are known. Examination of eight F; larvae 
in the offspring of a wild female gives the information of the gene arrangement 
in four third chromosomes (two maternal and two paternal). This method may 
be misleading if a female is fertilized by more than a single male and if the 
sperms of the different males mix. Since such instances do occur, although 
rarely, this method has been used only seldom, when an adequate sample of 
the flies could not be procured. The presence or absence of “sex-ratio” in wild 
females has been determined by cytological examination of their offspring, and 
in wild males genetically by observing the distribution of the sexes in their 
progeny. No significant differences in the incidence of the gene arrangements 
have been observed between males and females from the same sample; the 
data for both sexes have, therefore, been combined. It may, however, be noted 
that owing to the greater efficiency of the examination of the chromosomes in 
in the offspring of wild females, a major portion of the data has been secured 
from females, and males have been used chiefly when samples have been small. 

In 1939 and 1940 samples were taken regularly at all nine collecting stations. 
In 1941 only four stations, and in 1942 only three stations, one in each Andreas, 
Pinon and Keen, were followed. The territory of a “station” has, of course, 
remained constant throughout, and the traps in which the flies were collected 
were exposed always on the same trees. 











164 TH. DOBZHANSKY 
THE DATA 


Table 1 summarizes the data on the gene arrangements in the third chromo- 
somes at all stations on Mount San Jacinto. The dates of collecting, the 
gametic frequencies of the gene arrangements (in percentages), and the total 
numbers of the chromosomes examined (n) are given. The data are broken up 
by localities and stations, and within each station by breeding seasons. 
A “breeding season” is the interval between two consecutive contractions of 
the population. Thus, at Andreas Canyon the breeding season extends from 
September to June, although the flies almost certainly continue to breed in 
midsummer when their population density is very low. At Keen Camp the 
overwintering population may be rather large, but the ostensible absence of 
the flies during the winter months makes a convenient break. At Pinon Flats 
the flies are common in spring and in autumn, but the summer depression is not 
so striking as it is at Andreas; the “breeding season” at this locality is taken 
to extend from early spring till late autumn. 

The gene arrangement has been determined in a total of 12,807 third chro- 
mosomes. As stated above, the data in table 1 are given in terms of the gametic 
frequencies of the gene arrangements in each population sample. Every fly, 
however, is a zygote which carries two third chromosomes, a maternal and a 
paternal one. These two chromosomes may have either similar or different 
gene arrangements, and our technique of collecting the data has permitted the 
discrimination of structural homozygotes and heterozygotes. A problem of 
considerable importance is whether the populations studied are panmictic with 
respect to the gene «rrangements or broken up into castes or clans which 
interbreed to only a limited extent. Let the gametic frequencies of the gene 
arrangements in a population be (a+b+c+ - - -)=1. In a panmictic popu- 
lation the zygotic frequencies will be in accordance with the terms of the 
expansion (a+b+c-+ - - - )?. Isogamy (preferential mating of individuals with 
like gene arrangements) will cause structural homozygotes to be more and 
structural heterozygotes to be less frequent than in a panmictic population; 
anisogamy will have the opposite effect. DoBzHANSky and QUEAL (1938) and 
KOLLER (1939) have examined the zygotic frequencies of the gene arrange- 
ments in the third chromosome in populations from the Death Valley region; 
no significant deviations from the expectancies based on the hypothesis of 
panmixy have been found. Our data furnish a considerably more extensive 
material for a test of the validity of that hypothesis than the data ofthe 
authors just referred to. The results obtained will be presented in another 
publication; these results are in complete agreement with the view that the 
populations are panmictic. 

Three gene arrangements, Standard, Arrowhead, and Chiricahua, are com- 
mon in all localities and stations. The Tree Line arrangement is less common 
than the three main ones, but it is still present in most samples. Santa Cruz 
is decidedly rare, but it has been recorded at all but two stations. The differ- 
ences between localities, stations, and samples are quantitative. Standard is 
commonest at Andreas (57.6 percent), less common at Pinon (41.1 percent), 
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TABLE I 


Observed frequencies of the gene arrangements in different samples 
ST—Standard ; AR—Arrowhead ; CH—Chiricahua; TL—Tree Line; SC—Santa Cruz. 































































































DATE ST AR CH Sc n DATE ST AR CH TL n 
Keen A, 1939 Keen C, 1939 
Apr 24 42.9 23.8 23.8 9.5 _ 2r May 20 28.8 28.8 39.4 3.0 66 
May 13 39-1 20.7 20.7. 1.6 _— 64 Jun 2 19-0 35-0 41.0 5.0 100 
Jun 20 31.8 25.8 37.9 4.5 <= 132 Jun 29 20.8 28.3 49.1 1.9 106 
Jul 25 Me BC. BSE §:3 —_ 132 Jul 25 o2.9 393.6 43.2 4.5% 88 
Aug 31 20.0 36.0 38.0 6.0 — 50 Sep 1 25.0 30.0 42.5 2.5 4° 
Oct 22 26.0 43-5 26.3 4.4 _ 46 
Total 22 31.3 43-5 7 400 
Total 31 30.8 33.0 —_ 445 |— 
Keen C, 1940 
Keen A, 1940 Apr 14 31.0 19.0 46.0 4.0 100 
Apr 13 38.3 20.8 37.5 1.7 13.7 120 May 11 32.7. 25.0 38.5 3.8 104 
May 18 32.0 17.0 48.0 2.0 1.0 100 Jun 7 23-5 11.8 61.8 2.9 34 
Jun 8 gr. @.9 37.9 39.8 — 106 Jun 29 34-0 24.5 38.3 3.2 94 
Jul 14 36.0 23.0 40.0 1.0 _ 100 Jul 29 wa @£4. 2.2 34 52 
Aug 12 34.8 26.7 83:3 3-7 _ 108 
Sep 14 qt.2 17.6 32.4 &.8 _ 34 Total 29. 22.7 43-7 -9 384 
Total 34- 23.2 38.7 °.5 568 Keen C 26. 27.0 43.6 784 
Keen A, 1941 . 
K D 
May 10 25.8 29.0 42.0 3.2 _ 124 at 
Jun 5 40.0 16.7 38.9 4.4 — 90 May 20 32. 20-4 35-3 ‘9 34 
Jun 25 37.0 22.0 38.0 3.0 — 100] Jun 3_ 19. 37-7 40.8 2.3 130 
Jul 27 36.0 28.0 32.0 4.0 — 50 Jul 14 29. 27-5 37-0 “5 138 
Sep 6 38.2 25.0 32.4 4.4 _ 68 Sep 1 26. 33-8 36.8 -9 68 
Total 34. 24.2 37-7 _ 432 Total 25. 32.4 38.1 370 
Keen A, 1942 Keen D, 1940 
Apr 2 45. 17.6 20.4 -8 — 102] Apr 14 2 30.0 42.0 7.0 100 
May 2 45. 14-7 33-3 2. = 102 May 182s. 27.2 43-9 .S 114 
Jun 12 28. 15.4 46.4 0 “re, 110 Jun 29 31 14.4 49.0 8 104 
Jul 17 26. 18.0 52.0 = = 100 | Jul 29 40 32.0 24.0 0 100 
Total 36. 16.4 40.3 — 414 | Total 29. 25.8 40.0 418 
Keen A 34. 23:7. 37-5 0.2 18509 Keen D, 1041 
a + 
Keen B, 1939 May 10 - 29.8 de = 
Apr 25 21.1 36.8 42.1 — 19 304 s 3° a8 - os 
May 1333-3. 20-2 33-3 4-2 — a4 | Julbo 7 3s. en 
May 20 3I1.r 25.6 40.0 3.3 — 90 % 
Jun 20 aa sa ang *:5 A 78 Total 34. 24.1 36.1 332 
Jun 29) «634-4 «23-4 39-3 3-5 — 64 E a 
Jul 26 32-3 21.9 37-5 8.3 — 06 Keen D 29. 27.5 38.2 1120 
Aug 31 37-8 30.5 29.3 2.4 —_ 82 
Keen E, 1939 
Total 31.6 26.3 38.4 8 _- 453 May 20 30.0 35.0 35.0 20 
Jun 3 26.9 34-3 35-2 108 
Keen B, 1940 Jul 14 25.9 33-0 30-3 112 
May 11 on:6 98.5 @.8 8.9 1.0 102 Aug 18 26.9 26.9 46.2 78 
Jun 8 30.7 22.7 45.§ 1.2 — 88 
Jun 29 «634.8 9416.3 43-5 5-4 — 92 | Total 26. 32.8 39-3 318 
Jul 29 37:0 26.0 32.0 5.0 _ 100 
Total 30. 22.8 42.4 0.3 382 * One chromosome with a new gene arrangement 
Keen B 3I. 24.7 40.2 o.1 835 
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TABLE 1—Continued 






































DATE ST AR CH TL SC n DATE ST AR CH TL SC n 
Keen E, 1940 Andreas B, 1939 
Apr 13 90.6 922.2 45.2 32.2 — 144 Apr 24 62.5 20.0 15.0 2.5 ys 40 
May 11 22.6 15. 56.6 5.7 — 106 May 13 53.8 26.9 15.4 1.3 2.6 78 
Jun 7 28.3 23.6 43-4 4-7 sia 106 June 4 43-1 26.4 27.8 1.4 72 
Jun 29 6 Wig mie 6.8 1.0 104 
Jul 14 34.0 16.0 48.0 2.0 3 Too Total 51.6 25.3 20.0 1.6 1.6 190 
Aug 12 42.9 22.9 28.6 5.7 _ 7° 
Total gt.6 1.7 46.6 4.0 ©.2 630 Andreas B, 1939-1940 
Sep 21 56.9 26.7 14.7 1.7 _ 116 
Keen E 30.0 23.8 42.8 3.3 0.1 948 Oct 238 60.2 26.9 1.1 1.9 _ 108 
Dec 9 e.5 a4 Gf 8.8 a 78 
Keen 31.0 25.1 39.8 3.9 0o.1 5546 Jan 13 60.4 28.1 9-4 2.1 = 96 
Feb 10 57-8 24.1 16.4 1.7 _ 116 
Andreas A, 1939 (Spring) Mar 2 53-2 29.0 15-3 2-4 124 
Mar 26 or.2 $9 = 2.9 ah 34 Mar 28 62.0 16.7 16.7 4.6 - 108 
Apr 24 73-6 13.2 6.6 1.9 4.7 106 Apr 20 43-8 22.3 33.0 0.9 = 112 
May 13 54-8 30.7 11-3 3.2 — 62 | May19 29.2 33-3 33-3 4:2 — 24 
Jun 4 36.8 2.1 36.8 — 5.3 38 
Total 56.0 25.1 16.8 2.2 — 882 
Total 65.4 17-9 138.7 2.3 2.9 240 


Andr. B $5.2 32§.3 %7.4 2.8 ©.§ 1072 








Andreas A, 1939-1940 























Sep 21 52.9 21.6 20.6 2.9 2.0 102 Andreas 57-6 24.0 15.3 2.6 0.4 3818 
Oct 28 $2.0 34.3 10.8 2.9 _ 102 
Dec 9 46.2 31.7 19.2 2.9 _ 104 Pinon A, 1939 
Jan 13 SI.9 31-7 14-4 13.9 — 104 | Apr 24 48.7 33.3 15.4 2.6 39 
Feb 10 58.8 31.6 G2 2g _- 114 May 13 30.8 35.8 27.§ 3.3 2.§ 120 
Mar 2 62.7 24.6 8.5 4.2 —_ 142 Jun 21 go.8 99.2 9 29.6 116 
Mar 28 63.6 17.3 15.5 3.6 _ Ito Aug 19 34-7 38-8 23.6 8.3 _ 72 
Apr 20 45-0 24.0 26.0 4.0 1.0 100 Sep 19 49.0 21.0 24.0 4.0 ° 100 
May 19 43-3 26.7 28.3 1.7 _— 60 Oct a1 56.0 24.6 13.4 5.2 134 
Total 53-9 27.0 15.6 2 0.3 938 Total “S66 Si 2 45 68 581 
Andreas A, 1940-1941 Pinon A, 1940 
Oct 19 68.0 19.0 6.0 7.0 _ 100 
Mar 3 49:0 16.0 29.0 6.0 _ 100 
Nov 16 46.1 42.3 9.7 3.9 _ 26 Mar 29 0:8 tad we 4.8 i 
Nov 23 8 BS OS GD _ 48 : : : 4 
ae as ey ‘és oes ‘ Apr 21 34.9 18.6 41.9 3.5 86 
3 hs ” $ May to I. I ° 2 
Feb 1 49-0 33.0 16.0 2.0 — . 100 y se ee * — 
Feb 19 66.3 18.6 11.6 3.5 — 86 - ee ee ae NE pi e 
Mar 8 71.1 14.9 14.0 _ — 114 oe 34-6 25.0 37-5 2-9 a vo 
Apr 19 70.0 17.0 12.0 1.0 — 100 Nov 3 37-5 32-5 26.3 3.8 ces So 
Tota! 63.5 22.4 111.6 2.5 — 606 | Total 37-0 25.5 34.5 3.4 0.5 654 
Andreas A, 1941-1942 Pinon A, 1941 
Sep 6 59.8 23.8 14.8 1.6 —_ 122 Mar 7 56.4 10.9 23.6 8.2 0.9 110 
Oct 4 50.0 26.9 15.4 7-7 _ 26 Apr 19 58.2 20.0 17.3 4.5 _ 110 
Nov 8 66.0 18.0 11.0 4.0 1.0 100 May 10 44.0 28.0 24.0 4.0 _ 100 
Dec 6 48.0 34.0 15.0 3.0 _ 100 Jun 15 37-0 26.% 34.8 2.2 — 92 
Jan 11 $9.6 27.4 9-6 3.4 _ 146 Jun 28 29.0 38.0 30.0 3.0 _ 100 
Feb 2 64.0 18.0 15.0 3.0 _ 100 Aug 5 51.9 21.3 25.9 0.9 _ 108 
Mar 14 72.7. 20.5 6.8 _ — 44 Sep 6 61.4 13.6 15.9 9.1 _ 44 
Apr 2 73.3 %3.§ 9.6 2.9 1.0 104 Oct 4 50.0 25.0 16.7 8.3 — 36 
May 2 57-8 15.5 20.7 §.2 0.9 116 Nov 8 40.9 29.5 20.5 9.1 _ 44 
Jun 12 33-7 26.9 38.5 — 1.0 104 Dec 6 48.2 19.6 26.8 5.4 _ 56 
Total 58.3 22.3 16.0 2.9 0.4 962 Total @.4¢ S35 GS 4F €38 800 





Andr. A 58.6 23.6 14.5 2.8 0.5 2746 
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TABLE 1—Continued 

















DATE ST AR cz Tt xX n DATE ST AR CH TL SC D 
Pinon A, 1942 Pinon B, 1940 
Apr 2 51.0 21.6 19.6 7.8 — 102 Mar 3 40.3 17.7 33-9 8.1 _— 62 
May 2 48.0 17.0 25.0 9.0 1.0 100 Mar 29 48.2 20.9 28.2 2.7 _— 110 
Jun 12 29.8 22.8 40.4 7.0 -- 114 Apr 2r ms we mse FE (48 9° 
Jul 17 41.9 21.8 30.6 5.6 -- 124 May 10 25.0 23.0 44.0 8.0 _ 100 
—- Jun 1 20.6 30.4 43-3 5.9 _ 102 
Total 42.7 20.9 20.3 6.8 0.2 440 
Total 3-4 @.3 Bt 46 OS 464 








Pinon A 42.5 25.0 27.3 4.6 0.5 2475 





Pinon B av-6 WO 9.3 $7 6 968 
Pinon B, 1939 —— 

















Apr 24 54.6 22.7 9.1 13.6 ane 22 Pinon ee. oe eo | 
May 13 a.6 2.8 S65 §.6 2.7 120 

Jun 21 21.t 28.9 42.3 5.3 2.6 38 Grand 

Aug 19 36.9 33-3 27-4 2-4 ee 84 Total 48.7 25.0 29.4 §-7 Gg raBe7 
Sep 19 $8-3 94:4 22.9 _— _ 90 

Oct 21 53-3 26.0 19.3 0.7 0.7 150 

Total 41.5 29.4 25.6 °2.8 0.8 504 | 





and least common at Keen (31.0 percent); the corresponding figures for 
Arrowhead are 24.0, 25.0 and 25.1 percent, and for Chiricahua 15.3, 29.4, and 
39.8 percent. The differences for Standard and for Chiricahua are obviously 
significant. The totals for stations within a locality and for breeding seasons 
within a station agree with each other more closely. But an inspection of the 
data for different samples within a station and a breeding season discloses that 
appreciable changes occur from month to month, especially at Andreas and 
Pinon and to a lesser extent at Keen. 

For purposes of presentation it is desirable to anticipate at this point the 
results of the more rigorous tests to be described below. The relative fre- 
quencies of Standard and Chiricahua at Andreas and at Pinon change from 
month to month; the changes follow the seasonal climatic cycle. In figures 1-3 
the percentage frequencies of the gene arrangements (ordinates) are plotted 
against the months of the year (abscissae). Every sample taken during the 
four years of observation is shown by a dot. The curves represent the monthly 
averages of the frequencies of the respective gene arrangements. At Andreas 
and Pinon the frequency of Standard decreases from March to June and in- 
creases thereafter. Chiricahua is most frequent in early summer and reaches a 
low level during the winter months. Arrowhead is more stable than the two 
other arrangements, but it seems to be more frequent in spring than in summer 
and in winter. The changes at Keen are not so striking as those at Andreas 
and Pinon, but Chiricahua is apparently more frequent in May and June than 
in August and September. The amplitude of the changes in the composition 
of the populations within a breeding season is greater than the amplitude of 
the differences between breeding seasons within a station, between stations 
within a locality, and almost equal to the differences between the localities. 
The principal variable in the data is a cyclic change within a breeding season. 
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FiGuRE 1.—Frequencies (in percent) of the three most common gene arrangements in the 
third chromosome which occur in the population of Andreas Canyon. Dots—individual observa- 
tions; curve—monthly totals. This and the following two figures (Figs. 2 and 3) are based on 
observations which extended during four breeding seasons; the data for these four seasons are 
combined according to the month of collecting. 


CHANGES WITHIN A BREEDING SEASON 


The population of every station has been followed from two to four breeding 
seasons. In all, 25 sets of samples for different stations and seasons are shown 
in table 1. The degree of heterogeneity within a station and a breeding season 
can be estimated by means of a x? analysis. On the assumption that the ob- 
served heterogeneity is due to sampling errors alone, the expected numbers of 
chromosomes with each gene arrangement in every sample may be deduced 
from the seasonal totals entered in table 1. The numbers of Tree Line and 
Santa Cruz chromosomes, however, are too small in most samples for such an 
analysis; they are added to the Arrowhead chromosomes. The chromosomes 
are therefore divided into three groups: those with Standard, with Chiricahua, 
and with “Others” gene arrangements. The results of the computations are 
presented in table 2. 
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TABLE 2 
Variability observed within a station and a breeding season. 
































STATION AND df p STATION AND x df p 
SEASON SEASON 
Andreas A, Spring Keen A, 1939 II.29 10 0©.50-0.30 
1939 44.52 6 <o.o1 Keen A, 1940 9-38 10 0.50-0.30 
Andreas A, 1939-1940 44.37 16 <o.o1 Keen A, 1941 8.54 8 0.50-0.30 
Andreas A, 1940-1941 30.59 14 <o.o1 Keen A, 1942 18.59 6 <o.o1 
Andreas A, 1941-1942 77.82 18 <o.o1 Keen B, 1939 7.87. 10 0.70-0.50 
Andreas B, Spring Keen B, 1940 10.48 6 0.20-0.10 
1939 6.22 4 0.20-0.10 Keen C, 1939 4-74 8 0.80-0.70 
Andreas B, 1939-1940 41.34 16 <o.o1 Keen C, 1940 11.97. 8 0.20-0.10 
Pinon A, 1939 28.71 10 <o.o1 Keen D, 1939 4-54 6 0.70-0.50 
Pinon A, 1940 16.62 12 0©.20-0.10 Keen D, 1940 22.73 6 <o.o1 
Pinon A, 1941 40.77. 18 <o.o1 Keen D, 1941 9.67. 4 0.05-0.02 
Pinon A, 1942 16.53 6 0.02-0.01 Keen E, 1939 3-29 6 0.80-0.70 
Pinon B, 1939 39-77 10 <o.o1 Keen E, 1940 18.18 10 0.10-0.05 
Pinon B, 1940 28.92 8 <o.o1 
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FicurE 2.—The population of Pinon Flats. For further explanation see Fig. 1. 
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The data for Andreas Canyon fail to show a significant intra-seasonal change 
only for the station Andreas B in the spring of 1939. This is not surprising, 
since only three samples were taken at this station from April 24 to June 4, 
1939. Furthermore, the figures in table 1 show that Standard appeared to be 
more frequent in April than in June, which is the same direction of change as 
observed at the adjacent station, Andreas A, where the change is statistically 
significant. At Pinon Flats intra-seasonal changes are established for every 
station and season except for Pinon A in 1940 (table 2). But even in this case 
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Ficure 3.—The population of Keen Camp. For further explanation see Fig. 2. 


the frequency of Standard dropped from 49 percent in early March to 29 per- 
cent on June 1, and rose to 37 percent on November 2; Chiricahua rose from 
29 percent on March 2 to 42 percent on April 21, and dropped to 26 percenton 
November 2. This is the same type of change which is established at Pinon A 
and Pinon B in other years (table 1). At Keen Camp significant intra-seasonal 
changes are ostensibly rare (table 2). The population of Keen A has changed 
during 1942 but apparently not during 1939, 1940, or 1941. At Keen B and C 
no significant changes have been recorded. The population of Keen D under- 
went changes during 1940 and 1941 but not during 1939. At Keen E there was 
no change during 1939 but during 1940 there may have been some instability. 

The x? analysis estimates the probability that the heterogeneity of the 
samples taken within a breeding season is or is not due to sampling errors. 
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This analysis does not discriminate between haphazard and directed changes. 
Examination of table 1 shows that changes, at least at Andreas and at Pinon, 
are not haphazard. There is a clear tendency for the successive samples to be 
more nearly similar than the samples taken several months apart, except where 
the annual cycle is closed. The changes run parallel at different stations within 
a locality and in different years. The lowest frequencies of Standard at Andreas 
and Pinon have been observed eight times in June, thrice in May, and only 
once in November. The highest annual frequencies of Standard fell four times 
in March, four times in April, and once in September and October. The highest 
frequencies of Chiricahua are reached mostly in May and in June, but the 
lowest frequencies may fall in any month from September to April. The fre- 
quencies of Arrowhead fluctuate apparently irregularly. The directed nature 
of the intra-seasona] changes is attested by figures 1 and 2. If the changes in 
successive breeding seasons were unrelated, they would tend to cancel each 
other and the joint monthly averages would show no trends; actually, the 
curves connecting the monthly averages in figures 1 and 2 do show definite 
trends. 

The situation at Keen Camp may or may not be different from that obtain- 
ing at Andreas and Pinon. Since significant intra-seasonal changes have been 
observed in only three out of 13 sets of samples (table 2), clear trends are 
hardly to be expected. It should be noted, however, that since the collecting 
season at Keen is shorter than at Andreas and Pinon, the detection of the 
changes at Keen is correspondingly more difficult. 


DIFFERENT BREEDING SEASONS AT THE SAME STATION 


The problem next to be considered is whether or not the population of a 
station may change from one breeding season to another. Since significant 
alterations in the composition of the populations within a breeding season 
occur at least at Andreas and at Pinon, we must be on guard not to mistake 
spurious inter-seasonal changes for real ones. The samples have not.been taken 
at quite regular intervals, and some samples are greater than others. If, then, 
during one breeding season more or larger samples were taken—for example, 
in spring than in autumn—a spurious difference between seasonal totals may 
result. For this reason we disregard the samples taken at Andreas A and B 
in the spring of 1939 and-the samples taken at Pinon B in 1940 and at Pinon A 
in 1942. Hence, no inter-seasonal comparisons at all can be made for Andreas 
B and Pinon B, but data for three complete breeding seasons are available for 
Andreas A and for Pinon A. The x’’s are calculated as for the intra-seasonal 
changes, except that four classes of chromosomes are distinguished: those with 
Standard, Chiricahua, Arrowhead, and “Others” gene arrangements (“Others” 
are Tree Line and Santa Cruz combined). The results are shown in table 3. 

The populations which inhabited Andreas A in 1939-1940, 1940-1941, and 
1941-1942 appear to be significantly different (table 3). The meaning of this 
difference is nevertheless open to question, since no samples were taken at this 
station in May and in June of 1941. In other years the frequency of Standard 
reached a minimum and that of Chiricahua reached a maximum during these 
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months (see above). The seasonal total for 1940-1941 shows a lower frequency 
of Chiricahua and a higher frequency of Standard than do the seasonal totals 
for other breeding seasons at the same station. The seasonal totals for 1939- 
1940 and 1941-1942 do not differ significantly (table 3). The aberrant results 
for the season 1940-1941 are most probably accounted for by the incomplete- 
ness of the data for the critical months of May and June. No certain inter- 
seasonal changes have been observed at Andreas A. 


TABLE 3 


Different breeding seasons at the same station. 











STATION AND STATION AND 





e df  g x? df P 
SEASONS SEASONS 
Andreas A, 1939-1940, Keen B, 1939, 1940 1.76 3 0.50-0.30 
1940-1941, I194I- Keen C, 1939, 1940 10.12 3 0.02-0.01 
1942 18.37 6 <o.o1 Keen D, 1939, 1940, 
Andreas A, 1939-1940, 1941 11.18 6 0.10-0.05 
1941-1942 5-91 3 0.20-0.10 Keen E, 1939, 1940 19.42 3 <o.o1 
Pinon A, 1939, 1940, 
1941 36.61 6 <o.o1 
Keen A, 1939, 1940, 
1941, 1942 33-64 9 <o.o1 





The inter-seasonal differences observed at Pinon A are very significant, and 
they cannot be discounted on any such grounds as the changes at Andreas A. 
To be sure, more samples were taken during the autumn of 1941 than during 
the corresponding months in 1939 and 1940, and this may be a partial explana- 
tion of the high apparent frequency of Standard and the low frequency of 
Chiricahua in the total for 1941 (table 1). But the frequency of Chiricahua 
was appreciably and consistently higher during the summer months of 1940 
than during the summers of 1939 and 1941. Similarly, Standard chromosomes 
were rather more frequent in 1941 than in 1939 and 1940. The frequency of 
Chiricahua at Pinon B in 1940 was higher than in the corresponding months 
in 1939. The population of Pinon seems to change its composition from year 
to year as well as from month to month. Whether or not the intra-seasonal 
and the inter-seasonal changes are independent is quite another matter. The 
possibility must be considered that the climatic variations from year to year 
are reflected in the composition of the populations in the same manner as the 
succession of the seasonal climates. The winter of 1940-1941 was richer in pre- 
cipitation than the preceding and the following winters; the fly populations 
contracted much Jess in the summer of 1941 than they did in 1940 or 1942; the 
conditions in the summer of 1939 were about intermediate. In the midsummer 
of 1941 the flies were, in fact, rather abundant at Pinon Flats, while in other 
years they were abundant only in spring and in autumn. The contention that 
the composition of the population in 1941 differed from that in other years 
because of this vagary of the climate cannot be proved by the available data, 
but it seems not improbable. 
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The data for Keen Camp (table 3) are rather irregular. No inter-seasonal 
changes have been detected at Keen B and D, but at Keen A, C, and E some 
changes are indicated. In addition, the changes at Keen A appear to be di- 
rected: Arrowhead has decreased in frequency from 31 percent in 1939 to 23-24 
percent in 1940 and 1941 and to 16 percent in 1942. The figures for Keen sta- 
tions other than A show the same trends toward reduction of the frequencies 
of Arrowhead, although the statistical significance of these trends is doubtful. 
It looks as though the population of the whole Keen Camp locality has been 
losing Arrowhead chromosomes during the whole period of observation. The 
relative rarity of intra-seasonal changes at Keen (see above) makes this con- 
clusion somewhat stronger than it would have been otherwise. 


DIFFERENCES BETWEEN STATIONS WITHIN A LOCALITY 


Two sources of error must be guarded against in making comparisons of 
stations within a locality: the intra-seasonal and the inter-seasonal changes 
within a station. The stations Andreas A and Andreas B are only about 200 
yards from each other. Comparison of all the data for these stations gives 
x’=9.10, which, for 3 degrees of freedom, has the probability of chance oc- 
currence between 0.05 and 0.02, which is usually considered to indicate a sig- 
nificant difference. The observations at Andreas B, however, lasted only from 
April 1939 to May 1940, while those at Andreas A extended from March 1939 
to June 1942 (table 1). Comparison of the stations only during the period when 
both were under observation gives x?= 6.27, df=3, P=o.10; the difference is 
not significant. Comparison of the total data for Pinon A and Pinon B (dis- 
tance about 800 yards) gives x?=9.17, df=3, P between 0.05 and 0.02. Again, 
the duration of the observation period at Pinon B was less than at Pinon A. 
If identical time intervals are compared, x?= 2.02, df=3, P between 0.70 and 
0.50. The difference is not significant. 

The distances between stations at the Keen Camp locality are relatively 
large: Keen C and Keen E are about two and a half miles apart; the environ- 
ments at the Keen stations are rather diversified. In 1939 samples were col- 
lected at all stations. Comparison of the yearly totals gives x?= 24.72, df= 12, 
P between 0.02 and 0.01. Chiricahua was more frequent and Standard less 
frequent at Keen C than at Keen A or B. Unfortunately, the collecting at 
Keen A and B was started in April and at the other Keen stations late in May, 
which makes the validity of the observed heterogeneity between the stations 
doubtful. In summer of 1940 the five collecting stations at Keen were again 
followed. Comparison of the year’s totals gives x?= 12.43, df=12, P between 
0.50 and 0.30; the heterogeneity is not significant. In 1941 collections were 
made only at Keen A and D, with no significant differences detected: x?=0.59, 
df= 3, P between 0.90 and 0.80. 

The available evidence indicates that populations of stations within a locality 
do not differ in the relative frequencies of the third chromosome gene arrange- 
ments. WRIGHT, DoBzHANSKY, and Hovanitz (1942), however, have shown 
that third chromosome lethals found within a station are alleles more fre- 
quently than lethals found at different stations within a locality. These results 
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are not necessarily contradictory: the differentiation of the local populations 
need not go parallel with respect to different classes of genetic variants. The 
recessive lethals are not subject to strong selection pressures while in heterozy- 
gous conditions; the gene arrangements appear to respond to the environ- 
mental changes induced by the annual climatic cycle. 
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FicurE 4.—The frequencies of Standard (circles) and of Chiricahua (triangles) gene ar 
rangements in the third chromosome in the populations of Pinon Flats and of Andreas Canyon. 
Abscissae—time, ordinates—percents. 


SEASONAL CHANGES 
Various hypotheses have been tried out in attempts to discover which agent 
or agents are responsible for the production of the fairly striking intra-seasonal 


changes described above. It is conceivable that the flies spend their lives in so 
small a territory that several elementary breeding units occur within the area 
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of a collecting station. If so, different samples might represent now one and 
now another part of the area, and the differences between the samples might 
reflect this lack of uniformity. This hypothesis is excluded by the unpublished 
experiments which prove that the average radius of the daily activities of the 
flies is of the same order of magnitude as the diameter of a typical collecting 
station. Furthermore, this hypothesis fails to account for the observed paral- 
lelism of the variations at different stations within a locality and for the cor- 
relation between the compositions of successive samples. 

In contrast to the first hypothesis, it can be imagined that the flies range 
over territories of some tens of square miles in area and that they live in dis- 
crete droves or clans which maintain an internal cohesion in their wanderings. 
Fantastic as it is, this hypothesis is difficult to exclude entirely. It fails to ex- 
plain the observed cyclic nature of the changes, and it is contradicted by the 
results of unpublished experiments alluded to above which show that in an 
approximately uniform two-dimensional environment individual flies move 
more or less at random. 

A third hypothesis, which for a time appeared plausible enough (Dos- 
ZHANSKY 1940, 1941), is as follows: If the sources of food on which the species 
subsists in nature are abundant at only a few scattered points in the inhabited 
territory, it is possible that only a minute fraction of the individuals which 
compose the population of a station will locate the food and produce offspring. 
The fecundity of Drosophila pseudoobscura, under favorable conditions, is 
measured in thousands of eggs. The population of a territory as large as a 
station might, therefore, be descended from a small number of “lucky” in- 
dividuals. If so, the incidence of the gene arrangements in the population 
might fluctuate from generation to generation owing to the accidents of 
sampling involved in the choice of the parents. This hypothesis had tobe 
rejected for the following reasons. (A) Examination of the contents of the 
crops of flies caught in nature has shown that most individuals are able to find 
food every day even during the arid midsummer season in the mountains of 
southern California (unpublished data). (B) The mobility of the flies varies, 
depending upon temperature, from ten to about one hundred and fifty meters 
per day. The absolyte population density is mostly high enough so that a food 
source in an area of about one hundred meters in diameter will attract many 
flies. However, when the population density diminishes during unfavorable 
seasons the accidents of sampling may become important (unpublished data). 
(C) The observed changes in the incidence of gene arrangements run parallel 
at different stations within a locality. Except when the population reaches a 
minimum, the number of flies within a locality is much greater than could be 
descended from only a few parents. (D) The observed changes follow a seasonal 
cycle. 

The available data seem to fit best a fourth hypothesis, which assumes that 
the carriers of different gene arrangements in the third chromosome have 
different ecological optima. At Andreas Canyon and at Pinon Flats the Stand- 
ard gene arrangement is favored in spring and the Chiricahua arrangement in 
early summer. Natural selection alters the composition of the populations 
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accordingly. Since the gene arrangement which is favored at one season is dis- 
criminated against at other seasons, the net result over the whole year is 
typically no change at all—comparison of the totals for successive years shows 
usually no significant differences. The climatic variations from year to year, 
however, may disturb the regularity of the seasonal cycles and may produce 
the occasionally observed relatively slight differences in the composition of the 
population on successive breeding seasons. The fact of cardinal importance on 
which this hypothesis is primarily based is the cyclic nature of the observed 
changes in the composition of the populations. 

It has already been pointed out (p. 171, fig. 1-4) that, at least at Andreas 
and Pinon, the frequency of Chiricahua has reached a maximum in May, June 
and July on every one of the four years during which the observations have 
been made. Standard reaches its minimum while Chiricahua is at its maximum, 
and vice versa. It is important to examine this contention very carefully. Table 
4 presents the data for the entire period of observations broken up by months 
of collecting and by locality.'The seasonal changes at Andreas are very sig- 
nificant; if the chromosomes are divided into those with Standard, Arrowhead, 
Chiricahua, and “Others” gene arrangements, the x? turns out to be 128.23, 
df=27, P much less than o.o1. The changes at Pinon resemble those at An- 
dreas; they are highly significant: x*= 253.36, df= 21, P negligible. 

It is important to ascertain whether cyclic changes occur also at Keen Camp. 
A statistical test is negative: x?= 21.74, df=15, P between 0.20 and o.10. The 
greatest contribution toward this x? is made by Arrowhead chromosomes 
which contribute relatively little in the tests for Andreas and Pinon. Taken 
at their face value, the data suggest that Chiricahua is more frequent in May 
and June than it is in late summer and in autumn, and that Arrowhead is more 
frequent in August and September than it is during the earlier months (fig. 3, 
table 4). Significant changes within a breeding season have been observed at 
Keen A in 1942 and at Keen D in 1940 and 1941 (see above). At Keen A, the 
frequency of Standard decreased and that of Chiricahua increased from April 
till July of 1942; the seasonal change at Keen D in 1941, however, was in the 
opposite direction (see table 1). In some years samples at Keen stations have 
been taken over periods of five to six months (Keen A in 1939 and 1941, Keen 
B in 1939, Keen E in 1940), and yet no significant changes have been observed. 
The conclusion seems inescapable that seasonal changes at Keen, if any, are 
less pronounced and less regular than those at Andreas and Pinon during com- 
parable time periods. Nothing is known, of course, about the changes that 
might occur at Keen during the period from November to March when no 
population samples have been collected. 

The possible mechanisms through which a gene arrangement may influence 
the adaptive value of its carrier will be discussed below. 


DIFFERENCES BETWEEN THE POPULATIONS OF LOCALITIES 


Comparison of the totals for localities (tables 1 and 4) suggests that the 
Standard gene arrangement is more frequent at Andreas (57.65 + 0.80 percent) 
than at Pinon (41.13+0.84), and more frequent at Pinon than at Keen (30.98 
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+0.62). Chiricahua is more frequent at Keen (39.83+0.66) than at Pinon 
(28.14+0.77) and at Andreas (15.30+0.58). The frequencies of Arrowhead are 
uniform (24-26 percent). Tree Line is slightly commoner at Pinon (4.39+0.11) 
than at Andreas (2.62+0.08), Keen being intermediate. Santa Cruz is rare 
throughout. The data for separate breeding seasons agree very well with those 
for the entire period of observation. Localities ten to thirteen miles apart may 
therefore be inhabited by populations which differ in the incidence of the gene 
arrangements. This conclusion, although it is supported by other data (Dos- 
ZHANSKY and QUEAL 1938, KOLLER 1939, DOBZHANSKY 1939b, also unpub- 
lished information) as well as by those reported in the present article, is im- 
portant enough to deserve a most careful scrutiny. 

The collecting seasons at Keen, Pinon, and Andreas fall at different periods 
of the year. The frequencies of at least Standard and Chiricahua arrangements 
are known to undergo seasonal changes at Andreas and at Pinon. May it be 
that the populations of the three localities are really identical but they are out 
of step in their seasonal cycles? The data in tables 1 and 4 must be re-examined 
from this point of view. Let us compare the status of the populations of the 
three localities in the months of April, May, and June only; we find that the 
differences in the incidence of the gene arrangements are almost as striking as 
they are in the yearly totals (x?= 256.28, df=6, P negligible). The same is true 
if the months of September and October alone are considered (x?= 94.73, 
df=6, P negligible). The situation, however, is different if the data for May 
and June at Andreas, June and July at Pinon, and July and August at Keen 
are compared. A very significant difference is still apparent (x?= 47.71, df=6, 
P less than 0.01), but this difference is due principally to Andreas being differ- 
ent from Pinon and Keen, the latter two being very nearly similar. If the data 
for June alone are taken for Andreas, and compared with June and July at 
Pinon and July and August at Keen, no significant differences are any longer 
manifest (x?= 5.99, df= 6, P between 0.50 and 0.30). Such a comparison, how- 
ever, is not a fair one, because only 214 chromosomes have been obtained in 
June at the Andreas locality. To sum up: the populations of the three localities 
are different, but the amplitude of the seasonal variations at Andreas and at 
Pinon is so great that the inter-locality differences are at times almost or 
entirely submerged. 


POSSIBLE SELECTIVE AGENTS 


The assumption that the gene arrangement in the third chromosome affects 
the adaptive value of its carrier is a novel one and is contrary to the surmise 
which the writer had entertained in the past. Yet, Chiricahua chromosomes 
are favored at Andreas and at Pinon in late spring and in early summer, while 
Standard chromosomes are favored at other seasons and especially in early 
spring. The prevailing temperatures may be the selective agents, although both 
at Andreas and at Pinon the frequencies of Chiricahua drop and those of 
Standard rise again by the time the populations emerge from their summer 
eclipse stage. July and August are the hottest and driest months in both 
localities, yet in September the frequencies of Standard are higher and those 
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of Chiricahua are lower than they were in June. A more plausible view is that 
Standard is favored while the populations are at their highest density levels 
in the year, and that Chiricahua is favored while the populations are dwindling 
toward their summer eclipse stage, presumably because of a relative food 
scarcity. Experiments which may possibly shed some light on this problem 
are being planned. 

It is very improbable that the gene arrangements in the third chromosome 
of Drosophila pseudoobscura exert their effects on the physiology of their 
carriers because of the differences in the alignment of the genes (position 
effect). Such a view is contradicted by the absence, or at least the relative 
weakness, of the seasonal changes in the frequencies of gene arrangements at 
Keen Camp in contrast to their great amplitude at Andreas and Pinon. 
Furthermore, the summer environment at Keen resembles the spring environ- 
ments at Andreas and at Pinon to a greater extent than it does the summer 
environments in the latter two localities. The position effect hypothesis would 
lead one to expect that the frequency of Standard should be higher and the 
frequency of Chiricahua lower at Keen than at Pinon and Andreas. Exactly 
the opposite is observed (tables 1 and 4). The position effect hypothesis fares 
even worse if one takes into account the total geographic distribution of the 
gene arrangements. Standard is frequent on the Pacific Coast, rapidly declines 
in frequency as one proceeds eastward, is rare in the Great Basin and the 
Rocky Mountains, and absent in Mexico and Guatemala. Arrowhead is the 
predominant gene arrangement in the Great Basin and declines in frequency 
westward, eastward, and southward from there. Chiricahua reaches the highest 
known frequencies in the state of Chihuahua, Mexico, and in southern Cali- 
fornia (DOBZHANSKY and STURTEVANT 1938 and unpublished data). An anal- 
ysis of the gross geographic distribution of the gene arrangements has led 
EPLinG (unpublished) to the conclusion that it reflects the history of the 
species rather than the contemporary environments prevailing in different 
parts of the species area. 

The facts are in a much better agreement with the view that the variations 
in the adaptive values observed among chromosomes with different gene 
arrangements result from chance association of mutant genes with now one 
and now another gene arrangement. The establishment of such an association 
can easily be visualized, provided that the species is broken up into colonies 
isolated by distance or by distribution barriers. A mutant gene which is ad- 
vantageous at a certain season arises in a chromosome with, for example, the 
Standard arrangement. The descendants of this chromosome may attain a 
considerable frequency within the colony, and if the gene in question lies in the 
part of the chromosome in which crossing over is rare in inversion heterozy- 
gotes, the association may persist for a long time. In another colony the same 
mutant gene, however, may arise in a chromosome with a different gene 
arrangement. Thus, in some colonies a gene favorable at a given season may 
be associated with chromosomes with the Standard arrangement, in other 
colonies with Arrowhead, and in still others no such association may exist. 
The fact that most chromosomes in natural populations of Drosophila pseudo- 
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TABLE 5 


Observed frequencies of “‘sex-ratio” in different samples. SR—frequency in percentage, 
n—number of chromosomes examined. 








DATE 


SR 


n 





Andreas A, Spring 1939 





Mar 26 31.2 16 
Apr 24 23-7 93 
May 13 17.8 45 
Jun 4 11.1 27 
Total 21.0 181 





Andreas A, 1939-1940 





Sep 21 14.2 113 
Oct 28 17.1 123 
Dec 9 16.7 84 
Jan 13 14.8 128 
Feb 10 13.3 113 
Mar 2 32.5 231 
Mar 28 10.6 106 
Apr 20 10.9 92 
May 19 23.1 39 
Total 14.0 1029 





Andreas A, 1940-1941 





Oct 109 15.0 60 
Nov 24 15.2 33 
Dec 31 0.0 12 
Feb 1 7.8 77 
Feb 109 7.1 56 
Mar 8 13.7 153 
Apr 19 20.3 69 
Total 12.8 460 





Andreas A, 1941-1942 














Sep 6 11.4 79 
Oct 4 11.1 18 
Nov 8 35-7 98 
Dec 6 9.6 83 
Jan 11 8.3 96 
Feb 2 3.0 33 
Mar 14 9.8 41 
Apr 2 19.2 73 
May 2 19.3 83 
Jun 12 23.1 78 
Tota! 16.9 682 
Andreas A 15.1 2352 
Andreas B, 1939 
Apr 24 18.5 27 
May 13 28.3 53 
Jun 4 23-4 47 
Total 24-4 127 





Andreas B, 1939-1940 





Pinon A, 1942 




































































Sep 21 14.8 128 Apr 2 18.0 78 
Oct 28 9-9 III May 2 7.1 84 
Dec 9 15.8 57 Jun 12 12.4 81 
Jan 13 16.4 67 Jul 17 23-9 88 
Feb 10 19.8 96 
Mar 2 10.2 187 Total 15.4 331 
Mar 28 12.9 85 — 
Apr 20 15.3 85 Pinon A 12.6 2070 
May 109 7.2 14 
Pinon B, 1939 
Total 13.6 830 Apr 24 6.0 16 
May 13 15.3 124 
Andreas B 15.1 057 Jun 21 26.9 26 
Aug 10 8.5 71 
Andreas I§.I 3309 Sep 19 8.3 72 
a Oct 21 9.1 187 
Pinon A, 1939 ae. 
Apr 24 8.7 23 Total II.1 496 
May 13 8.6 116 
Jun 21 11.6 86 Pinon B, 1940 
Aug 19 7-1 28 Mar 3 17.1 41 
Sep 19 7-6 106 Mar 29 10.2 98 
Oct 21 11.5 183 Apr 2t 14.3 63 
May 10 18.8 85 
Total 9.8 542 Jun 1 15.7 83 
Pinon A, 1940 Total 14.9 370 
Mar 3 11.2 107 
Mar 29 9.6 104 Pinon B 12.7 866 
Apr 21 6.6 61 
May 10 15.2 92 Pinon 12.6 2936 
Jun 1 11.4 44 
Sep 14 13.2 76 Keen A, 1939 
Nov 2 7-7 39 Apr 24 7.1 14 
May 13 17.0 47 
Total II.1 523 Jun 20 17.0 135 
Jul 25 17.7 113 
Pinon A, 1941 Aug 31 17-7 51 
Mar 7 13.7 117 Oct 22 23.3 30 
Apr 19 13.2 151 
May 10 14.3 84 Total 17.4 390 
Jun 15 8.9 56 — 
Jun 28 23.2 82 Keen A, 1940 
Aug 5 16.3 98 Apr 13 8.7 92 
Oct 4 24-0 25 May 18 10.2 88 
Nov 8 9-4 32 Jun 8 11.9 42 
Dec 6 3-4 29 Jul 4 22.1 77 
Aug 12 20.0 80 
Total 14.5 674 Sep 14 5.6 18 
Total 14.1 397 
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DATE SR n DATE SR n DATE SR n 
Keen A, 1941 Keen C, 1939 Keen D, 1941 
May 10 14-5 117 May 20 21.3 47 May 10 14.8 101 
Jun 5 14.3 49 Jun 2 21.7 69 Jun 5 15.7 89 
Jun 25 17.1 76 Jun 29 14.9 74 Jul 7 17.7 85 
Jul 27 15.8 38 Jul 25 16.7 66 Aug 12 7-9 101 
Sep 6 21.9 41 Sep 1 20.0 15 
_ —_ Total 13.8 376 
Total 16.2 321 Total 18.4 271 
—_- Keen D 15.1 1009 
Keen A, 1942 Keen C, 1940 —_— 
Apr 2 14.9 14 Apr 14 7-9 63 Keen E, 1939 
May 2 6.8 74 May 11 12.4 97 May 20 14.3 14 
Jun 12 15.9 88 Jun 7 16.7 18 Jun 3 22.6 115 
Jul 17 15.6 77 Jun 29 16.1 31 Jul 14 16.1 62 
Jul 29 23.8 21 Aug 18 18.6 86 
Total 13-4 313 5 
Total 13.0 230 Total 19.5 277 
KeenA 15.3 1421 _- 
Keen C 16.0 501 Keen E, 1940 
Keen B, 1939 Apr 13 £4.28 112 
Apr 25 14.3 14 Keen D, 1939 May 11 14-7 95 
May 13 11.8 17 May 20 20.0 25 Jun 7 18.1 83 
May 20 12.5 64 Jun 3 16.5 127 Jun 29 21.6 37 
Jun 20 16.0 50 Jul 14 11.6 112 Jul 14 15.3 59 
Jun 29 21.1 52 Sep 1 14.3 42 Aug 12 12.5 48 
Jul 26 25.6 78 
Aug 31 17.7 51 Total 14.7 306 Total 15.4 434 
Total 18.4 326 Keen D, 1940 Keen E 17.0 711 
— Apr 14 13.6 88 — - SUE 
Keen B, 1940 May 18 20.8 96 Keen 16.2 4250 
May 11 23.4 04 Jun 29 18.2 66 
Jun 8 20.0 55 Jul 29 14-3 77 
Jun 29 19.1 63 as 
Jul 29 18.6 70° Total 16.8 327 
Total 20.6 282 





Keen B 19.4 608 





obscura contain deleterious recessives (DOBZHANSKY, Ho1z, and SPASSKY 1942) 
will, of course, counteract too high an increase in frequency of the descendants 
of an individual chromosome, even if that chromosome contains a mutant 
which is per se favorable. The optimum condition is probably the presence in 
a population of a variety of chromosomes with different gene arrangements 
and different gene contents. A similar conclusion has been arrived at through 
a different argumentation by StuRTEVANT and MATHER (1938). 


THE “SEX-RATIO” 


In the populations of Mount San Jacinto the gene arrangement is constant 
in chromosomes other than the third and the X. Two types of X chromosomes 
occur: the Standard and the “sex-ratio.” Males which carry “sex-ratio” pro- 
duce almost unisexual progenies. The situation with the “sex-ratio” gene 
arrangement is peculiar in that, as first shown by Gershenson (1928), it should 
increase in frequency automatically until the species becomes homozygous for 
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it. Actually, its frequency in populations of Drosophila pseudoobscura varies 
from zero to about 30 percent. The level to which the “sex-ratio” is allowed to 
accumulate is evidently controlled by natura] selection, although precisely 
how selection acts to restrict the spread of the “sex-ratio” is unknown (STURTE- 
VANT and DoBZHANSKY 1936). Variations in the frequency of such a character 
in different localities and at different seasons, however, are not unexpected. 

A summary of the data on the occurrence of “sex-ratio” in populations of 
Mount San Jacinto is presented in table 5. In all, 10,495 X chromosomes have 
been examined from the three localities. The “sex-ratio” has been found at all 
the stations and in all but two very small samples. Its frequency, however, 
appears to vary a great deal. The variations are analyzed in table 6. Stations 
within a locality are alike in the incidence of the “sex-ratio.” No significant 
changes either within or between breeding seasons have been observed at the 


TABLE 6 


Variations in the frequency of the “sex-ratio” within a breeding season, at different breeding seasons, 
and at different stations within a locality. 











ANDREAS PINON KEEN 

x? df 4 x? = df lg x? df P 
Within season 74.78 36 <o.o1 45-64 31 0.05-0.02 55-47 51 0.60 
Between seasons 19.39 4 <o.0O1 12.41 4 0©.02-0.01 9-17. 8 0.50-0.30 
Stations ©.005 I 0.95-0.90 0.011 I 0.95-0.90 6.70 4 0.20-0.10 





Keen stations, but at Andreas and at Pinon changes do occur. Thus, the be- 
havior of the “sex-ratio” parallels that of the gene arrangements in the third 
chromosome. In both instances the variations within a breeding season are 
apparently the primary phenomenon, while the variations between the seasons 
are in part, but only in part, accountable for by the incomplete regularity of 
the sampling in different months and in different years. 

The situation is clarified further by breaking up the data according to the 
month but regardless of the year of collecting (table 7). The behavior of the 
“sex-ratio” is analogous to that of the Chiricahua gene arrangement in the 
third chromosome: at Andreas and at Pinon its frequency increases in late 
spring, reaches a maximum in May, June, or July, and drops thereafter to its 
winter level. The changes are statistically significant: for Andreas x?= 47.86, 
df=o, P negligible; for Pinon x?=17.94, df=8, P between 0.05 and 0.02. The 
data are consistent throughout, except for a very high value for the month of 
November at Andreas (table 7); this value depends, however, upon a single 
aberrant sample. No significant changes occur at Keen: x?=10.77, df=5, P 
between o.10 and 0.05, or else slight changes of the same type as at Andreas 
and Pinon. The total frequency of “sex-ratio” at Keen (16.2 percent) is greater 
than at Pinon (12.6 percent), while Andreas is intermediate (15.1 percent). 
The differences are statistically significant (x?=15.45, df=2, P<o.or), but 
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they cannot be considered established because of the errors introduced by the 
intra- and inter-seasonal changes it the separate stations and localities. 


OTHER DATA 


Aside from the body of data presented above, there exist other data which 
show that the relative frequencies of gene arrangements may vary with time. 
DoBZHANSKY and STURTEVANT (1938) observed a significant change between 
autumn and the following spring in a population of San Gabriel Canyon, 
California. This population, however, has not been studied further. A popu- 


TABLE 7 


Frequency of the “Sex ratio” (in percentage) in different months. 











ANDREAS PINON KEEN 

SR n SR n SR n 
January rg.2 291 _ — _ _— 
February 12.0 375 _ — _— _ 
March 12.2 819 11.8 467 _ — 
April 17.8 439 12.5 392 11.8 457 
May 20.9 234 13.2 585 15.3 976 
June ar.1 152 t<.2 458 17.7 1319 
July _ = 23.9 88 17-3 935 
August _ _ 12.2 197 15.3 417 
September 13.8 20 : 254 
Pr a va Pg 395 ai - 
November 30.5 131 = — 
December *3.% 236 {r-0 = _— _ 





lation sample taken in 1938 at Wildrose Canyon, Panamint Mountains, Cali- 
fornia, by KOLLER (1939) differed in composition from that taken in 1937 by 
DoszHANSKY in the same neighborhood. Further samples were taken at four 
collecting stations in Wildrose Canyon in 1939 and 1940. No new changes have 
been observed, but it is to be noted that the collecting was done always in late 
May or early in June. Proressor J. T. PATTERSON has very generously sent 
to the writer for examination the population samples taken at frequent inter- 
vals in 1939, 1940, and 1941 at Aldrich, near Austin, Texas. In Texan popula- 
tions the commonest gene arrangement in the third chromosome is Pikes Peak, 
Arrowhead is less common, and Tree Line, Olympic, and Estes Park occur as 
admixtures (for descriptions of these gene arrangements see DoBZHANSKY and 
STURTEVANT 1938). A total of 1279 third chromosomes have been examined. 
Slight but significant changes occur at Aldrich: the relative frequency of 
Arrowhead is highest in March and declines as the season advances, while 
Pikes Peak displays a seasonal trend which is the reciprocal of that of Arrow- 
head. Furthermore, in 1939 Arrowhead was less and Pikes Peak was more 
common at Aldrich than in 1940. In 1941 the species, according to Patterson, 
was abnormally rare; the small sample obtained showed a very low frequency 
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of Arrowhead. The differences between the seasonal totals for 1939 and 1940 
are statistically significant. 


DISCUSSION 


Until recently biologists generally conceded that evolutionary changes, 
apart from those engendered by artificial selection and by polyploidy, are too 
slow to be observed directly. Instances of rapid transformation, however, have 
accumulated to such an extent that this concession is no longer necessary (for 
a review see DOBZHANSKY 1941). These instances are roughly of three sorts: 
changes in microorganisms, emergence of new varieties of wild species adapted 
to conditions altered by man (the best example of this being the cyanide re- 
sistant forms of scale insects), and cyclic changes connected with the alterna- 
tion of the seasons. To the last class belong the remarkable observations of 
TIMOFEEFF-RESSOVSKY (1940) on Adalia bipunctata, which reveal a situation 
singularly like that in Drosophila pseudoobscura. Adalia bipunctata is a beetle 
which occurs in several color phases determined by a series of alleles of a 
certain gene. The populations of any one locality are usually mixtures of 
several color phases; populations of different geographical regions differ as a 
rule in the relative frequencies of the same color types which can be found in 
any one of them, although certain types are geographically restricted. In the 
environs of Berlin, the darkly pigmented types increase in relative frequency 
from spring to autumn, but, owing to a greater mortality of the dark phases 
during the hibernation, the lighter types become again more frequent in 
spring. 

It is evident that in at least some species the genetic composition of popula- 
tions is more plastic than it was believed hitherto. The pressure of the environ- 
ment varies in intensity as well as in kind in different localities and in different 
seasons; some populations are able to respond to these variations by startlingly 
rapid reconstructions of their genotype. As shown above, the frequency of the 
Chiricahua gene arrangement in the population of the Andreas Canyon is 
about doubled between March and June of each year (table 4). This time 
interval can hardly correspond to more than two generations. Even if every 
chromosome with the Chiricahua gene arrangement contains genetic factors 
favorable in the summer environment (which may well be doubted), a change 
so swift as the above indicates a selection coefficient of an unexpectedly high 
magnitude (s approximately 0.4). The drop in the frequency of the Chiricahua 
arrangement during the last summer at Andreas is almost as precipitous as 
its rise in spring. Selection coefficients of the order s=o.1 must be assumed to 
operate to bring about the observed changes in the frequencies of the gene 
arrangements in the third chromosome and of the “sex-ratio” at Andreas and 
Pinon Flats. Yet, it is customary to think that the adaptive changes in race 
and species formation are built with the aid of much smaller selective advan- 
tages. 

The changes observed in Adalia and Drosophila seem to follow the seasonal 
climatic cycle. If they are constantly recurring and completely reversible, they 
represent, from the point of view of long range evolutionary progress, largely a 
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wasted motion. It may even be that species undergoing changes as rapid as 
these never actually reach the highest theoretically possible adaptive levels: 
the species may at any one time be adapted best to the conditions which have 
just lapsed. Such a situation is likely to obtain in many organisms which in- 
habit the Temperate Zone, and in general any country with pronounced sea- 
sonal climatic changes. Organisms which produce several generations per year 
will be most subject to this unending race with the environment, but an essen- 
tially similar situation may obtain also in organisms with long developmental 
periods. 


SUMMARY 


In 1939 and 1940 population samples were taken at approximately monthly 
intervals at nine collecting “stations” in three “localities” on Mount San 
Jacinto, in California. In 1941 four of these stations, and in 1942 three stations, 
were followed. The relative frequencies of the various gene arrangements in 
the third and X chromosomes in these samples have been determined. In all, 
12,807 third and 10,495 X chromosomes have been examined. 

In the populations of at least two of the three localities the relative frequen- 
cies of the gene arrangements change from month to month. The changes are 
regular and cyclic. They seem to follow the annual climatic cycle. 

The composition of the population of a locality may change also from year 
to year, but such changes, as far as known, are not as striking as the seasonal 
ones. 

The populations of stations within a locality are alike at any one time. The 
populations of localities ten to fifteen miles apart may be permanently differ- 
ent. 

It is surmised that the changes observed are caused by natural selection 
favoring the carriers of different gene arrangements at different seasons of the 
year. Chance associations of the gene arrangements with genetic factors pro- 
ducing physiological modifications, rather than position effects, are probably 
responsible for the phenomena observed. 
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INTRODUCTION 


WO mutant alleles have been found in the house mouse which segregate 

normally in heterozygous females, while heterozygous males generally 
produce a significant excess of gametes containing one of these mutant factors. 
The inheritance and some of the developmental effects of these factors have 
been described (CHESLEY and DuNN 1936; DUNN 1937; GLUECKSOHN-SCHOEN- 
HEIMER 1938; DUNN and GLUECKSOHN-SCHOENHEIMER 1939; KOBOZIEFF and 
POMRIASKINSKY-KOBOZIEFF 1939a, 1939b), and it has been shown that each 
mutant is lethal when homozygous, interacts with a third dominant factor T 
to produce complete taillessness, and affects the segregation ratio in all 
heterozygous males. The effects associated with the principal combinations of 
the three alleles are as follows: 

TT—lethal, dies at 11 days; T+ —-short tail (“Brachy”) 

2°t°’—lethal, dies at five to six days; T?’—tailless; +-/°—normal tail 

“t—lethal, dies before five days; T¢#'—tailless; +-#—normal tail 

t°t'\—-viable, normal tail; males entirely sterile 

The dominant T segregates normally in both sexes; females T?°, +?°, T?', 
+t' show normal segregation, but the corresponding males (sibs with other- 
wise similar genotype) generally produce an excess of offspring which inherit 
t° or '. The amount of excess of ¢° or # varies with individual males. Thus one 
male T?° gave a progeny ratio (when tested by ++) of 122 #°:9 T, equality ex- 
pected; while other males varied from this to a progeny ratio of 93 #°:42 T. 
Males T#, +#°, and +# show similar distortions of segregation. The aberrant 
ratios have been shown to be characteristic of ¢° and # males and to be truly 
genotypic and not merely phenotypic ones. There is evidence that they are not 
due to differential viability or to errors in classification (DUNN and GLUECK- 
SOHN-SCHOENHEIMER 1939). It has been assumed that ?/° and #' produce ab- 
normalities in spermatogenesis resulting in an excess of spermatozoa with ?° 
or #, and this is being investigated further. 

Genetical evidence already published (DUNN and GLUECKSOHN-SCHOEN- 
HEIMER 1939) suggested that the degree of aberration of an individual male 
might be determined or affected by his genetic constitution with respect to 
factors other than #! or ¢®. A number of sons of a “low ratio” (97 #:8 T) anda 
“high ratio” (38 ¢':13 T) male of genotype 7? were tested and showed a tend- 
ency to repeat the father’s type of aberrancy. 


PLAN OF EXPERIMENTS 


The sons of the same father, however, differed significantly among them- 
selves. From this beginning, five generations of male descendants of the “low 
ratio” male have now been tested with results which are reported below. The 
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object of the experiment was to test whether or not variations in the ratio of 
i! progeny have a genetic basis. If they have, then successive groups of brothers 
descended from one male should resemble each other in their ¢' ratios, and by 
selection one type of aberrancy should become fixed, or at least the ratios from 
brothers should become more homogeneous in successive generations of selec- 
tion. 

The segregation ratio of + to #' is most readily revealed by mating +?! to 
Brachy (T+). The progeny of such a cross consist of three phenotypes: normal 
(++, +), Brachy (T+), and tailless (T?'). The last two each receive T from 
the Brachy parent, while one receives +, the other /' from the +7 parent. 
The progeny ratio of Brachy to tailless should therefore represent the ratio of 
+ to ¢' gametes formed by the +#' parent. When the +?' parent is a female, the 
ratio is always near to the expected one—that is, 2 normal tail: 1 Brachy:1 tail- 
less. When the +/' parent is a male, the ratio of Brachy to tailless is almost 
never 1:1 but varies up to 1 Brachy: 20 tailless or even more. For example, the 
progeny of the 63 +/' males tested by Brachy females for the purpose of this 
experiment consisted of 2543 normal-tailed, 577 Brachy, 1932 tailless. Here the 
ratio of normal-tailed to abnormal (Brachy plus tailless) is 2543:2509, indi- 
cating normal disjunction of + and T in the Brachy mothers; while the ratio 
of Brachy to tailless indicates a disturbed segregation of + and #' in the fathers. 

The changes in the variance of the +:?' ratios of +/ males may therefore 
be measured by testing the heterogeneity of the ratios of Brachy to tailless 
progeny from groups of +?#' brothers in successive generations. This was begun, 
as stated above, by mating ten normal sons (+7') of a tailless (T#) male with 
Brachy females (7+), both males and females having been derived from four 
backcrosses to the same standard inbred stock. 


NEW DATA 


The actual ratios produced by the ten F; males tested were shown in table 6 
in the paper of DuNN and GLUECKSOHN-SCHOENHEIMER (1939) but are repro- 
duced in table 1 herewith, for comparison with the ratios produced by sibling 
groups of their male descendants. The females to which all of these males were 
crossed were genetically similar; they were short-tailed animals (T+) derived 
from several backcrosses (always two or more) to a standard inbred stock 
(Bagg albino). They were not entirely isogenic, but an attempt was made to 
randomize possible differences among the females by breeding each female in 
succession to the several males whose ratios were being compared in any gen- 
eration. Other evidence indicates that the mother has little if any influence on 
the ratio of #1 and + among her offspring, this ratio being determined by the 
father. Differences among the females, however, might influence the ratios 
produced by their sons. In practice all the sons of one or two males per genera- 
tion were saved for testing. Since any one mother produces relatively few sons, 
the sons of different mothers were saved. Actually, then, the sibling groups 
consist of sets of full brothers and of half brothers, as indicated in table 1. The 
degree to which these differences in relationship may influence the hetero- 
geneity in sibling groups may be estimated by comparing groups of full 
brothers with the sibling groups as a whole. 
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TABLE I 





Heterozygous male descendants (+1) of o 3290, tailless (T#), tested for variation in the segrega- 
tion ratio of + and t'. Groups of + brothers and half brothers crossed with Brachy (T+-) females 
derived from backcrosses to inbred Bagg albino stock. Progenies of fewer than 30 individuals have 
been omitted. Full brothers are designated by braces {. Males whose sons were tested are shown in 











italics. 
NORMAL 
MALE NO. FATHER TAIL ere — TOTALS of 
Peg Tt (T+:T#) 
Fi 3598 3290 98 7 78 183 
3761 . 41 12 33 76 
3764 e 35 12 29 76 
3860 ° 83 18 77 177 - 
3861 , 48 16 33 97 > 
3863 . 27 5 17 49 
3864 “ 66 18 56 140 -_ 
4006 3 23 3 II 37 
4010 z 54 15 45 114 
Totals 521 118 408 1047 


Heterogeneity of Brachy:tailless x= 17.10; p=.05 











F2 4429 F, 3598 7° 4 65 139 
4474 ig 52 5 38 95 
4084 +i 57 6 62 125 
4782 vi 31 4 17 52 
4796 : 53 12 51 116 
Totals 263 31 233 527 
Heterogeneity: x?= 7.2; p=.13 
Fs 5085 F, 4782 23 5 5 33 
5086 . 114 13 120 247 
5145 ° 19 3 20 42 
Totals 156 21 145 322 
Heterogeneity x?= 13.62; p=.oo1 
Fs; 5081 F. 4684 37 6 27 7° 
5127 z 48 10 43 96 
5162 = 32 5 19 56 
Totals 117 21 89 222 
Heterogeneity x?=.1; p=.95 
F; 5088 F, 4796 72 29 66 167 
Fy | 5855 F; 5086 39 12 21 72 
5856 . 37 II 30 78 <2 
6058 m 35 10 21 66 ‘ 
6059 18 9 9 36 
5885 " 32 10 28 7° 
5886 ” 66 38 34 138 
6072 “s 45 18 18 81 << 
6073 si 43 3 40 86 
5997 r 37 6 27 70 
5935 35 II 22 68 
Totals 387 128 250 765 


Heterogeneity x?= 37.35; p= <.0or 
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TABLE 1—Continued 














NORMAL 
MALE NO. FATHER TAIL gg ar si TOTALS (T+ ™#) 
+++, +F i 

Fe 5741 F; 5088 29 5 28 62 
5772 = 21 7 17 45 
5941 ‘ 40 13 37 go 7 
5943 x 43 6 24 73 
5835 f 12 4 14 30 
5922 , 41 13 24 78 
5923 : 39 12 23 74 
5925 : 60 13 42 = ? 
Totals 285 73 209 567 
Heterogeneity x?=6.39; p= .50 

F. {5304 F; 5162 39 6 47 “ — 
5658 a 26 10 15 51 ‘ 
5662 ” 45 14 33 82 
Totals 110 30 95 235 
Heterogeneity x?= 8.75; p=.o1 

Fe {5757 Fs 5127 55 6 38 _ - 

“ : 
5759 41 13 24 78 
Totals 96 19 62 177 
Heterogeneity x?=5.17; p=.02 

Fs | 6380 Fy 5886 38 6 24 68 
6664 - 41 8 24 73 - 
6474 ” 24 I 16 4! 

6475 . 28 10 12 50 eid 
6576 z 38 7 27 72 

6878 : 34 17 25 76 

Totals 203 49 128 380 
Heterogeneity x?= 11.71; p=.02 (6474 and 6380 taken together) 

Fs }) 6372 Fy 5925 32 7 34 73 6 
6520 26 I 29 56 7 
6394 . 26 7 25 58 
6 “ ° 7 

395 41 9 23 73 
6385 33 I 30 44 
6401 . 35 10 29 74) 
6402 = 20 6 17 43 “3 
6403 4 32 11 14 57 
= ” 20 I 17 38 
6581 . 50 10 40 100 ro 
6670 9 30 6 28 64 
6748 7 26 5 14 45 
6818 ° 35 13 19 67 
Totals 404 87 319 810 


Heteroteneity x?= 31.94; p= <.oo1 (6520, 6385 and 6582 taken together) 





Tests of significance of differences among the ratios of sons of the same male 
were made by calculating a x? for heterogeneity according to the method of 
BRANDT and SNEDECOR (FISHER 1941, p. 85). Wherever class frequencies of 
1 and 2 were encountered (for example, Fs; 06474) the data for such males 
were combined with those from another brother in order to eliminate the low 
frequencies which introduce errors in the calculation of x? for the sibship. 
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Among the ten sibships (groups of sons) belonging to the F,-F, generation 
derived from one T?' male, the ratios of Brachy to tailless in the progenies 
show a significant heterogeneity in seven sibships. Three sibships (an F: of 
five brothers, an F; of three brothers, and an Fy, of eight brothers) yielded 
values of P greater than 0.05 and hence were substantially homogeneous. 
There was no tendency for the ratios to become more homogeneous as selection 
proceeded, both groups of sons in Fs showing significant ratio differences. 
When the Brachy-to-tailless ratios of the ten groups were compared, a sig- 


TABLE IA 
Summary of Table 1 








VALUE OF P 





BRACHY:T. 
GROUPS OF BROTHERS COMPARED ( ennai nars0s) 





ALL FULL 
BROTHERS BROTHERS 





GEN. NO. FATHER NORMAL BRACHY TAILLESS TOTAL 





F; 3290 521 118 408 1047 +05 +05; .02 

F; 3598 263 31 233 527 +13 

F; 4782 156 21 145 322 -OoI 

F; 4684 117 21 84 222 -O5 

Fy 5086 387 128 250 765 <.001 .5; .oo1 

Fy 5088 285 73 209 567 $0 8 «73 «3 

Fy 5162 III 30 04 235 OL <.o1 

Ky 5127 96 19 62 177 -02 .02 

F; 5886 203 49 128 380 -02 -7; -006 

F; 5925 404 87 319 810 <.00t .06 .%5 .35 oF 
Totals 2543 577 1932 5052 <.001 





nificant heterogeneity was likewise found (P=.oo01). These calculations show 
that the ratios produced by sons of the same male may be quite diverse. 

The groups of sons compared consist actually of groups of half-brothers, 
since all sons were not derived from the same mother. There are, however, 14 
groups of two or more full brothers. These are the groups enclosed in brackets 
in table 1. The values of x? and of P were calculated for each group, and the 
P value is shown at the left of each group in table 1. Four of the groups con- 
tained one member in which the frequency of the rare class (Brachy) was less 
than five—namely, 6073, 6474, 6520 and 6582. Comparison of these with 
a full brother was made by calculating P exactly from a 2X2 contingency 
table according to the method described by FISHER 1941, p. 94—that is, 
P=(a+b)! (c+d)! (a+c)! (b+d)!/n! a! b! cld! 

Of the 14 values of P obtained, seven indicate a significant heterogeneity 
among full brothers (table 1a). These values occur chiefly in the fourth and 
fifth generations, and there is no indication that full brothers tended to become 
any more similar during the selection experiment. 

The amount of heterogeneity is about as great among the full brother groups 
as among the sibships generally, indicating that possible differences among the 
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mothers are probably not very important in determining this heterogeneity 
in the ratios. 

The differences in the ratios produced appear to persist in spite of consider- 
able genetical similarity among the males tested. This genetical similarity is 
inferred from the descent of all the males from one male through matings with 
animals derived from the same inbred stock. Although this does not ensure 
isogeneity, it makes it very unlikely that genetical differences influencing the 
ratios should be distributed in such a way that half of all groups of full brothers 
should show significant differences within the fraternity. On the contrary, the 
data appear to favor the assumption that the differences between the brothers 
are due to non-genetic causes. 

There is as yet no evidence as to what these causes are, but they are prob- 
ably connected with another peculiarity of males heterozygous for ?¢' or ?#°. It 
has been found, from tests of +é' males for example, that the offspring which 
inherit + and # are not distributed at random, but that those inheriting + 
tend to occur in groups or clusters within a litter (DUNN and GLUECKSOHN- 
SCHOENHEIMER 1939). This non-random distribution of one type of gamete 
may cause significant differences in the ratios of two different males, one of 
which produces a “cluster,” the other of which does not. “Clustering” has been 
found in connection with the segregation of /°, #1, and / (a probable recurrence 
of #', DUNN 1939) and only in connection with those factors, which are also the 
only three factors in mice known to be subject to variation in segregation ratios 
like those reported in this paper. 

For the present we may conclude that the variations among related males 
in the ratios in which the factor ¢' segregates from its alternative condition are 
probably non-genetic. 

SUMMARY 


Among males segregating for a lethal factor /', variations in the ratio of + 
to #! were found when males +/' were tested by Brachy (short-tailed T+) 
females. Such variations persisted among groups of brothers descended 
through five generations of selection from the same single male. The con- 
tinuance of such significant heterogeneity among brothers was taken as an 
indication that these variations in segregation ratios are non-genetic. 
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